ON DIVISION ALGEBRAS* 


BY 


J. H. M. WEDDERBURN 


$1. The object of this paper is to develop some of the simpler properties 
of division algebras, that is to say, linear associative algebras in which division 
is possible by any element except zero. 

The determination of all such algebras in a given field is one of the most 
interesting problems in the theory of linear algebras. Early in the develop- 
ment of the subject, Frobenius showed that quaternions and its subalgebras 
form the only division algebras in the field of real numbers and, with the ex- 
ception of the single theorem that there is no non-commutative division algebra 
in a finite field, no further definite result of importance was known till Dickson 
discovered the algebra referred to in § 4. 

It is shown in the present paper that the Dickson algebra is the only non- 
commutative algebra of order 9 so that the only division algebras of order not 
greater than 9 are (i) the Dickson algebras of order 4 and 9, (ii) the ordinary 
commutative fields, (iii) algebras of order 8 which reduce to a Dickson algebra 
of order 4 when the field is extended to include those elements of the algebra 
which are commutative with every other element. 

§ 2. Lemma 1. Jf B is a subalgebra of order b in a division algebra A of 
order a, there exists a complex C of order e such that 


A = BC, a = be. 


Denoting elements of B by y with appropriate suffixes, let 22 be an element 
of A which does not lie in B; the order of the complex B + Bye is then 2b as 
otherwise there would be a relation of the form / + yo * = 0, (% +0), 
which would lead to x2 = — ½ f < B. Similarly, if 23 < B+ Bao, the 
order of B + Bas + Bez is 3b since otherwise there would be a relation of the 
form / + Ye 2 + ¥3%3 = 0, (ys + 0), which would lead to 


t= — Yi — Yr * < B+ Bar. 


Since the basis of A is finite, the truth of the lemma follows by an easy in- 
duction. 
* Presented to the Society, February 28, 1920. 
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£ is divided on the right and left by & — b, the remainders are ay b" + a,b" 
+a, and b” ay ay + +++ 4 dy respectively. 

The proof of this lemma is exactly the same as in ordinary algebra, due 
care being taken to distinguish between multiplication on the right and on 
the left. 

A factor which divides a polynomial on the right (left) will be referred to 
as a R. F. (L. F.). 

Lemma 3. / 


Lemma 2. If a polynomial* ag + a, + + a, ina scalar variable 


A = ilo + ay — + + Cm 
and 


B= bo +b, . 


are polynomials in a scalar variable &, there exists a highest common right-hand 
factor (H. C. H. F.) Ci and a highest common left-hand factor (H.C.L.F.) C and 
polynomials Li, My, La, Mz such that 


Li A.+ M, B = Ci, Al, + BM, = 2. 


If n = m, we ean, by right-hand division, determine polynomials Oi and R, 
in ? such that A = Q, B + Ri, where Ni is of lower degree in & than B. 
Obviously any C. R. F. of B and HN is a R. F. of A; we can therefore proceed 
with the proof exactly as in ordinary algebra. 

The theory of linear factors of-a polynomial in a sealar variable is by no 
means so simple as in commutative algebras. Their properties depend mainly 
on the following considerations. Let 4 = BC be a polynomial in & expressed 
as the product of two polynomial factors B and C, and suppose that & — x isa 
right factor of A but not of C; we have then C = Qi (& — x) + R, where Q, 
is a polynomial and R is independent of & and is not zero. Multiplying by B 
we get 4 = BQ: (£ — x2) + BR, whence € — is a R. F. of BR so that we 
can set BR = (E ) or B= Q2.R'(E — RrR“), ie., 8 — is a 
R.F. of B. 

A ease of some importance arises when the algebra is quadrate i.e., where 
scalars are the only elements commutative with every element of the algebra, 
and when A = 0 is the reduced equation of this algebra. Regarding these 
algebras we have the following 

Lemma 4. If (i) = 0 ts the reduced equation of an element of a quadrate 
division ulgebra, then, if p is the degree of &, the scalar polynomial & () can be 
expressed rationally as the product of p linear factors which may be permuted 
cyclically. 

Since ¢ (21) ¢ — x, is both a right and a left factor of 6 (?), and so 

Throughout this paper all elements such as ao, ai, «++ are to be considered as belonging 
to a division algebra unless the contrary is stated explicitly. 
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is also k — 2’ if 2’ is any transform of 1. Let @(£) = (E — 41), then, 


if x’ is a transform which is not equal to v, we have 

(E) 3 (FS — 21) = B(E— 2’) 24); 
hence, as above, — 2 = — (A“ — i) (K — 41071 is a R. F. 
Similarly if B = ! (E — 2), and x” 
is not a R.F. £ — 41), we find as above that 


T3 = — Rr“ NI R = 2 — + 1) 4“ vot] 0, 


is a transform of 2; such that 


is a R.F. of B’; and so on. Continuing this process we get finally 


(8) = C(é Ym) (& (8 42) (8 11) = CD (m 290. 


where, if / is any transform of 2,, then ? — y is a R. F. of D. If therefore 


D En ＋ ai S ' 4+ --+ + am, then 
(1) y” + + On = 0 


for every transform / of. If the a’s are not all scalar, let z be an element 
which is not commutative with at least one of them and let a; = za; 27 
Since (1) is satisfied by every transform of 21, it follows that every transform 
also satisfies 


(2) + ym a = 0 


in which ‘at least one coefficient differs from the corresponding coefficient 
in (1). Subtracting (1) from (2), we get therefore a new equation of lower 
degree than m which is not identically zero and which is satisfied by every 
transform of 2), say 


(3) + Bi + --- +8, = 0. 


If the 6’s are not all scalars, the degree can again be lowered by a repetition 
of this process, till finally an equation is reached with scalar coefficients not 
all zero; we can therefore regard the 8's as scalars without loss of generality. 
Since however the identical equation is irreducible, the left-hand side of (3), 
with y replaced by & must be divisible by & (E) whence it follows immediately 
that G = D m = p and 


60 = (& — fp) (E — 21) (E 1). 


The linear factors are permutable cyclically since their product is a scalar. 

The necessary modifications in the lemma when the algebra is not quadrate, 
will be obvious after the proof of theorem 1 below. 

$3. Tueorem I. If its field, V, be suitably extended, any division algebra, A, 
can be expressed as the direct product of a commutative algebra, B, and a simple 
matric algebra. B is composed of all elements of A which are commutative with 
every other element, and its basis may be so chosen as to be rational in F. 
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It has been shown elsewhere“ that a division algebra, A, of order a, reduces 
to the direct sum of a number of simple matric algebras when the field is 
extended by the adjunction of a finite number of suitably chosen algebraic 
irrationalities. In this extended field, FP’, we may therefore write 


A=Ayt Az 4 + Ap, 


where 


It is then obvious that the algebra, A, whose basis is e; = LV (i = 1,2, 
-++,b) is composed of all elements of which are commutative with every 
element of .f. By expressing the basis of B in terms of any rational basis, 


any element y of B can be expressed in the form y = >> &;2;, where the 2's 


are rational elements of .1, not all zero, and the £'s are marks of F’ which are 
linearly independent in F. If now x is any rational element, we have, from 


the definition of B, ry = yx; hence 


O = ay — yr = DoE ax), 


and therefore, since the S's are linearly independent in F, it follows that 
vey — = for every x; and The elements of the subalgebra generated 
by the elements r are therefore commutative with every element of Hand this 
algebra, which is rational, is equivalent to Bin F’. 

If we extend the field F so as to include the elements of B, we get a new 
division algebra, 1’, of order c and when this field is again extended, A’ 
reduces to a matric algebra which is simple, as otherwise B would not contain 
all elements commutative with every element of .1. It follows immediately 
that all the algebras I., -l2, «++, 1, have the same order a/b and that, in 
F’, A is the direct product of a simple matric algebra C and the commutative 
algebra B. 

If B reduces to the identity, A is said to be quadrate: its order is a square, 
and sealars are the only elements commutative with every element of the 
algebra. 

Turorem II. Lf a division algebra, A, contains a quadrate subalgebra, Ii, 
it can be expressed as the direct product of B and another algebra C. 

if F’ is the field F so extended as to render B reducible to the simple matric 
form, then it is known that can be expressed as the direct product of B 
and an algebra C which contains all elements of A which are commutative 
with every element of B. Any element z of C can be expressed in the form 

Doe. where the 2's are rational elements of A and the 8 


s 


s are marks 


*Proceedings of the London Mathematical Society, Vol. 6 (1907), p. 102. 
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of F’ which are linearly independent in F. If y is any element of B which is 
rational in F, then yz = zy; whence 


0 = yz — zy = LE. (. — 2. ), 


so that, as before, yx, = x, for every y of B. Hence the elements , belong 
to ( which has therefore a basis rational in F. 

Tueorem III. Jf A is a quadrate division algebra whose order, , is the 
square of a prime, then the adjunction to the field F of any irrationality which 
renders any number of A wholly or partially reducible also reduces A to a matric 
algebra; or, in any field in which A is not primitive, it is equivalent to a matrie 
algebra. 

This theorem follows immediately from theorem 1 and the theorem that 
any simple algebra is the direct product of a division algebra and a simple 
matric algebra, the latter being necessarily of order p* since p is a prime. 

$4. The only types of division algebras hitherto discovered all come under 
the type described by L. E. Dickson.* These algebras are defined by the 
relations 

ry = yO(x), y" g, 
where the identical equation of x is a uniserial Abelian equation of degree n, 
and g is an element of the field of the coefficients which is not the norm of any 
rational polynomial in x. The following investigation serves to show the 
manner in which these algebras arise. 

Let x be an element of a division algebra A in a field V, and let the order 
of the algebra, M, generated by x, be m, then by lemma I we can determine a 
complex Z = (21, 22, „ n) such that A = 21 ＋ +---+ Vz, 
where 1 is of order mn. Every element / of A can therefore be expressed in 
the form y = >> g, e, where the g’s are polynomials in x, and in particular we 
may set 


= 911 21 + 912 22 + 2 
= 921 21 + 922 22 9 n 2 


Zn = gal 21 + 9% Ze + + Gan 2n- 


If is the matrix (%), whose coefficients lie in the field F (x), these equa- 
tions may be put in the form 


(21, „ )r= 
and it follows immediately that 


(zi, 
Cf. Dickson, these Transactions, vol. 15 (1914), p. 31, and Wedderburn, ibid., 
p. 162. 
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Hence, if m = n, the matrix @ satisfies the same identical equation as æ and 
in any case G satisfies an equation of degree n whose coefficients may however 
contain &.“ 

In the field F (2), x is always a root of this equation, the corresponding 
invariant axis being the modulus of the algebra. If now the identical equa- 
tion of G is abelian, its roots are polynomials in a which are rational in F, 
say 0, (Y) (T= 1, 2. „ (r) =a), and to each root there corre- 
sponds a rational element of the algebra, say y,, such that 


Yr r = (r) yr; 


and this leads to Dickson's algebra when the abelian equation is uniserial. f 
$5. We shall now show that the Dickson algebra is the only quadrate 
division algebra of order 9. 

Let 2; be an element of such an algebra which is not commutative with 
any of its transforms so that its identical equation, 


is not abelian. By lemma 4, we can express f (£) in the form 
f(&) = (& — 3) (E 22) (E — 41) 

where the factors may be permuted cyclically and no two of the æ's are com- 
mutative.[ Since E — 22 is a right-hand factor of f (E), and 

* 2 = (2 — 4 (2 — 410 
leads to x; ®2 = n contrary to our assumption, therefore 
(5) 3 = (2 1 — T2) 42 (2 — 420 1 
and by symmetry, permuting the suffixes cyclically, also 

(Ti — ) (12 — 110, 


(z x2 — z ) v3 (23 T2 — Lo z 
But, from (4), 
a Xe — ＋2 41 — Ta T1 = 


hence, permuting cyclically, 


T2 — Vo T1 + T1 = + Vo T3 + I Tz = + T1 + Taz, 


It can be proved in the same way that, to every element y; of A, there corresponds a 
matrix Y;, whose coefficients are scalar polynomials in z, such that (z) ys = F. (2); and if y; 
is a second element of A and FV, the corresponding matrix, the matrix belonging to y; y; is 
F. Y; = ( Y; Y,). 

+ I have been unable to construct an algebra of this type which is not also a Dickson algebra 
i. e., one for which the equation is uniserial, but it appears probable that they exist. 

t As the roots of f() are necessarily distinct, any number commutative with 2; is a scalar 
polynomial in 
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whence 
z T1 — TI T2 = TI Tz — Ti = rz — Tz rz = , 
say, so that from (5) 


Therefore ö is commutative with 2; and, as y is not a polynomial in 2, it 
follows that / = hisascalar. We may then assume that the identical equa- 
tion of x, has the same form, say aj = g. 

Let now 21 = 21 „ 22 11 81 = xi then 


but, since v2 21 %3 = g, we have x2 1 = 23, so that 
= V - yry)/h, 
since y® = h, and therefore 21 22 — 22 21 = 0, i.e., 22 is a polynomial in 21 so 
that the identical equation is a uniserial abelian equation and the algebra is 
of Dickson’s type. 
The identical equation of 21 is easily found as follows: 


therefore 
21 they = ay: = i = hg. 
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OSCILLATION THEOREMS FOR THE REAL, SELF-ADJOINT 


LINEAR SYSTEM OF THE SECOND ORDER* 


II. J. ETTLINGER 
INTRODUCTION 


It is the object of this paper to determine the number of oscillations of a 
linear combination of the form (2) for the systems (3) and (4). From these 
results, an oscillation theorem for the solution uw, (2) , corresponding to the pth 
characteristic number of (4), is obtained. 

Given the second order self-adjoint linear differential equation 


du 


df. | 
@) 


and two linear combinations of a solution which does not vanish identically 
and its first derivative, 
(2) (A, X) N) u (, X) - B. (, X) K (, X) (, X) (i 
we shall impose the following conditions and shall assume that they are satis- 
fied throughout this paper: 
I. K (æ, X), 6(æ, X), 8B. (æ, X), (, X), Biz A) are 
continuous, real functions of æ in the interval 
(A) (a S Sb) 
and for all real values of Xx in the interval 
(A) (Li H). 
II. K (, X) is positive everywhere in (X, A) and 
a. + 6. > 0 
in (X, A). 
III. For each value of x in (A), K and G decrease (or do not increase) 
* Presented to the Society, Sept. 6, 1917. 
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as X increases. In no sub-interval of (A) are K and simultaneously inde- 
pendent of X and in no sub-interval of (A) is identically zero. 
IV. Either 8. = 0 for all values of x and Xx in (A, A); or else 6; + 0 in 

a < ax <b for all 's in (A) and one of the following is true, 

either (a) 8, (4) = 0 for all X’s in (A), 8. (b) + 0, — a: (5%. (0) 
decreases (or does not increase) as X increases; 

or (b) B;(b) =0, B;(a) + 0 for all x's in (A), a; (a)/8; (a) 
decreases (or does not increase) as X increases; 

or (e) 8. () +0, 6. (0) + 0, a: (a)/B;(a) and — a;(b)/8;(b) 


decrease (or do not increase) as X increases. 


min G 
lim — @, 
min K 
max G 
lim 


I. THe STURMIAN SYSTEM 


Concerning the system 


d du 
(3) K(2,a) o, 


Ii[u(a,r)] = O, = O, 


STURM’S OSCILLATION THEOREM} may be stated with Béchert substantially as 
follows: . 
The system (3) satisfying conditions I-V has an infinite set of characteristic 
numbers such that 
SL; CXO C RIC < 


and U(x, X/), the pth characteristic function, vanishes exactly p times on 
a<z<b. 

We seek to determine the number of oscillations of LI U (, X)]. We 
notice first that if 81 = 0 for all values of 2 and Xx in (X, A), then all the 
zeros of U (, X.) and LiII U (2, \,)] coincide, since ai (, X) +0. Hence 
we have for 81 = 0 precisely p zeros of Lil LU (, My) ] ona <a <b. 

With Boécher§ we define 


a) 


(ai Bi} = Bi a, — ag +7 — BiG. 


This condition may be replaced by other sets of conditions. See Bécher, Legons sur les 
Méthodes de Sturm (hereafter referred to as Legons) (1917), chap. III, paragraphs 13-15. 
Journal de Mathématiques pures et appliquées, vol. 1 (1836), 
. 106 ff. 
t Lecons, p. 63 ff. 
§ Legons, p. 45. 
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Let 81 + 0 in (X, A), and consider the zeros of U (x, X) and Lil U (x, X)]. 
For Xx = Xo, LII L (a, = O and LII U (b, xo) = 0, but U(x, Ao) does 
not vanish on (A). Hence {a; BI must vanish for some value of x in 
(A), since if (an < 0 throughout (A), III L (, No)] could vanish 
but once in (A);“ and if (ar BIE > 0 for every x in (A), U (a2, No) would 
vanish once in (X) r If fa: 8:1} > 0 for X = Ri for all values of 2 in (A), 
then the zeros of U (, X)] and U(a2,X) separate.t But U(a,X,) van- 
ishes exactly p times on a <a <b. Hence Lil U X)] vanishes p — 1 
times ona <x<bforp=1. 

If 81 (, X) =O in (A), Bi (, X) + O fora gr b, then aq (a) + Oin 
(A). Then the zeros of U (, Mi) ] and U (x, Xi) separatet each other 
ona c b, provided (ar BH, > 0. Leta be the first zero of U (a, My) 
on dq h. Then 

Lil U (xn, Ny)! — Bi (r, Ny) K(x, Ny) UD. Ay). 
Now direct computation shows that 


a, (a) 


But A(a)U,(a) and K (ri, X/.) U. (21, Ap) have opposite signs, and 
sgn Gi (Ai, = sgn 6100). f Hence 


sgn Lil U (i. N)] = sgn Lil U (a)] sgn (U) sgn ai (a). 


Thus III U (x, X)] vanishes or does not vanish in a < x < 2 according as 
a, (a) 8; () is negative or is positive respectively. Hence if a;(a)6,(b) >0, 
Lil U (x, has p—1 zeros on a<a<b, and if a,(a)6i(b) <0, 
Lil U (x, Xi) ] has p zeros ona <x <b, forp21. 

If (U, X) = Oin (A), (, X) + for a = 2 < b, then a similar argu- 
ment can be made with the following result: if Lil U (2, X.)! 
has p — 1 zeros ona <2 <b, and if ai (b) (a) < 0, III U (, has p 
zeros ona <2<b,forp21. Therefore 

OSCILLATION THEOREM I. Jf U (, xy) is the pth characteristic function of 
the system (3) satisfying conditions I-V and if {a, 81} O for R = XI for every x 
in (A), then Lil U (x, Xi) I will vanish ona <x <bforp=1, 

p times if 

either 81 = Oin (X, A); 

or Bi(a) = O in (A), GI (, X) 0 in 4 
or 8106) =Oin (A), (, X) O in a 


p — 1 times if 


x S band ai (a) <0; 
x <band ai (b) (a) <0; 


IA A 


* Bocher, Legons, p. 51. 
tT Ibid., p. 50. 
tsgn f = sign of f. 
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either 81 + Oin (X, A); 
or (a) = O in (A), (r, X) Oina <x S hand an (a) B,(b) > 0; 
or 81 00) = O in (A), G (r, X) Oina S < and a () G (a) > 0. 
Let N) (A, X) — (&, X) K (, X) (, X), where 
a, G, ar, B, are continuous in (X, A). Then we may state 
OsciLLXTION THEOREM II. IF U is the pth characteristic function 
of (3) satisfying conditions I, where {a; Bi} > O, {a8} > O for S R and 
(a, B aB,) + O in (A, A), then LIU (x, Mi) ] will vanish ona<ax<b, 
21. 
p times if 
either 81 + Oin (X, A); 
or (a) = O in (A), GI (, X) + Oina 
or By(b) in (A), GI (, X) + Oina 
p + 1 times if 
either By O in (X, A); 
or 6104) O in (A), GI (r, X) Oina <x S band ai Bi(b) <0; 
or Bi(b) = in (A), G1 (, X) Oina < band ai Bi(a) <0. 
Proof: The zeros of U (, Mi) ] and LIU (2, x)] separate one another 
on (A).“ But III U (a, x,)! =0 and III L O. Hence 
LIU (A, Xx.) ] has one more zero on < x < h, for I, than LII U (, Ny) ]. 
This proves Theorem II. 


x = b and ai (a) 81050) > 0; 
r and (b) 0; 


IA A 


II. THE GENERAL SELF-ADJOINT SYSTEM 


Consider the system 
l | 
9 45 A) u 0, 
(4) dx dx 
Li[u(a,r)] = (CV, X)], La[u(a,r)] TzIu (b, X)], 
where Li and Lz are defined as in (2) and 82: # Of in (A, A). We impose 
further conditions: 


VI. fa; +0 for X XI, 
VII. Bo — 2 By — 1 in . 
VIII.: ai(a) Bila) 61 (05) 


() (4) qa () 82 (0) 
Aai(a) A661 (4) () 461 (0) 
Aq (d) (4) Aq (Y) 482 (0) 
Boécher, Legons, p. 50. 
+ This restriction does not involve a loss of generality, since if 6: =0, 82 = 0 then we 
may interchange Li and Le. 
taf =f(’+ Ad) —f(A) for XO. 
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In a recent paper* the writer proved the following theorem concerning the 
system (J), satisfying conditions I-VIII: 

There exists one and only one characteristic number of (4) between every pair 
of characteristic numbers of the Sturmian system (3). If Xp represents the 
ordered characteristic numbers of (3) and lL, those of (4) (account being taken of 
their multiplicity) then 

Case I. 1, is in the interval (Xp, Apis) Uf Lz IU (b, xo)! (Lie) > 0.F 

Case II. 1, is in the interval (r, Ap), p 2 1, i 

<9. 

Let u½% (r = u(x,/,) be the pth characteristic function of (4). We pro- 
ceed to consider the number of oscillations of L,[u,(a2)]. We notice first 
that if \ = /, is a double characteristic number, then /, coincides with ME, 


Ny, or Mir, and Ly [ wu, (2) ] will have the number of oscillations designated by 


Theorem I. 

If X =./, is a simple value, we may discriminate between Case I and Case II, 
following a method due to Birkhoff.t If 6; 0 in (X, A), the sign of 
L2.[U (b, xo) ] is the same as — a; (a) a ,), which is negative, since a; + 0 
in (X, A). Hence Lo[U(b,Xo)] is negative. Also & (Li + €) has the sign 
of — B.(a)B.(b), but 82 +0 in (X, A). Hence ¢(£; + €) is negative, 
and we have Case I where /, is on the interval A, I X MI. By The- 
orem I, L,{U (2,X,)] vanishes exactly p times on a < x < for v I, and 
Lil LU (x,Xpi1)] vanishes p+1 times on a<a<b for p21. But 
Lil U CU, X) 0 for Xx XC Hence U (x, X)] vanishes p + 1 
times on a <x <b for x <A <Apii. But the roots of LII U (æ, X)] and 
(, separate one another, and U (a, x)]! = O. Hence 
Lilu (, X)] vanishes p + 1 or p + 2 times ona < / h for M CX < Apis, 
p21. Hence LII % (r)] vanishes either p + 1 or p + 2timesona <a <b, 
p=1. But from (4) the number of zeros of L;[u,(2)] is always even. 
Therefore we have p + 2 roots if p is even and p + 1 roots for p odd, p= 1. 

If 81 (4) = 0 in (A) and 81 05) + 0, the sign of Lz I L (b, Xo)!] is that of 
a, (a)/B;(b), and @(£; + €) has the sign of 8;(6)/a;(a). Hence we have 
Case I where % is on the interval A, <A <Apii. If ar (4) Bi(b) < 0, by 
Theorem I, LII U (2,\,)] vanishes p times ona < x < band Lil U (x, Xii)! 
vanishes p + 1 times on a <2 <b. Reasoning exactly as above, we find 
that ¢i[u,(a)] vanishes p + 2 times on a < 2 <b if p is even and p+ 1 
Existence Theorems for the General, Real, Self-Adjoint Linear System of the Second Order, 
these Transactions, vol. 19 (1918), p. 94. 

+ (N) = 0 is the characteristic equation of (4) whose roots are the characteristic num- 
bers, /,. Li Te is a value of Xx in (A) near Li. 

t Existence and Oscillation Theorem for a Certain Boundary Value Problem, these Trans- 
actions, vol. 10 (1909), pp. 259-270. 

§ Bécher, Legons, p. 48. 
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times if pisodd,p 21. If ai (a) Bi (b) > 0, by Theorem I, LII L (, 
vanishes p — 1 times ona < x < b, and Lil U (2, Xii)! vanishes p times on 
a But LII U (b, X)] + OfordA, XC and (a, x)] = 0. 
The roots of LI I U (, X)] and (, X)] separate one another. Hence 
Li ()] vanishes either p or p +1 times ona<a<b, p=1. But 
from (4) the number of zeros of L,[u,(2)] is always even. Therefore we 
have p roots if p is even and p + 1 reots if p is odd. 

If 81 (5) = 0 in (A) and (a) + 0, the sign of L2[U(b,Xo)] is that of 
— Bi(a)/a,(b), and G (Li + €) has the sign of — 6,(a)/a,(b). Hence 
we have Case again. If ai (b) Bi (a) < 0 we proceed as before and obtain 
p + 2 zeros of Li lu. ()] ona <2 <b if p is even and p + 1 zeros if p is 
odd, p21. If ar (b) Bi (a) >0, we obtain p zeros of LII u ()] on 
a<a<bif pis even and p + 1 zeros if p is odd, p= 1. 

If 61 + 0 in (AX, A), the sign of L.[U(b, Xo)] is that of (a)/8,(b), 
which is positive. The sign of & (Ai + €) is that of 8: (a) 62 (0) — 82 () 61 00) 
if 81 (a) 62 (5) — 52 (4) () does not vanish. If 

Bi (a) 62 (0) — 62 (4) 61 00) = 0, 
the sign of @ (£, + €) is that of G1 () Bi (), which is positive. Accordingly, 
if 81 (a) 62 (U) — 62 () Bi (b) S O, we have Case I, and is on <A < 
By Theorem I, LII U (a, My) ] vanishes p — 1 times, and, proceeding as above, 
we find that LI II/ ()] has p zeros if p is even and p + 1 zeros if p is odd, 


p=1. Likewise, if 81 () 62 (U) — 62 (4) 61 (0) 0, is on <A Ry, 
and LIUI＋ ()] will vanish p times if p is even and p — 1 times if p is odd, 
oii. 

Summarizing the various kinds of coefficients of the boundary conditions 
of (4) as follows: 


A: 61 = O in (A, A); 
B-: Either 81 () = O in (A), 61 (, N) / O in a <a Sb, and 
() ; 
or 61 (0b) in (A), 61 (æ, X) O in aS <b, and 
() (CA) <0; 
Either 81 () = 0 in (A), (, X) in a S b, and 
() Bi(b) > 0; 
or 81 (b) = Oin (A), Bi (, X) + O in a S b, and 
ai(b)Bi(a) > 0; 
Ct: By + in (A, A) and 81 (4) Bo(b) — 62 (4) Bi (b) = 0 in (A); 
(: By + O in (X, A) and (4) B2 (0) 52 (4) Bi (b) < Oin (A); 
we state 
OSCILLATION THEOREM III. If u (r) is the pth characteristic function 
corresponding to a simple value of the system (4) satisfying conditions I-VIII, 


4 
A 
x 
75 
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then the number of zeros of Li{u,(a)] on a <a <b for p= 1 is given by the 
following table: 
Case Number of zeros 
p = 2m p=2m+1 
A or B pt+l 
B or ＋ 1 
ct p-l. 


Furthermore, we notice that, since a; Bs — az B; + O in (X, A), the zeros 
of LIIUr (r)] and Le[u, (a) ] separate one another on X, and, if X = /, isa 
double value, the number of zeros of L2[u,(a)] is given by Theorem II. If 
= isa simple value, both LI and L2[u,(a)] by (4) have an even 
number of zeros on (X). Hence they oscillate the same number of times. 
Therefore 

Corotiary.. The number of zeros of ona <a <b, for p=1, 
is precisely the number given in the table of Theorem III. 

From the foregoing results we may deduce the number of oscillations of 
1% (). If X =1, isa double value, then u, () will differ from U (a, Xi), 
(r, X/), or U(x, Xii) by at most a non-vanishing factor, and the number 
of zeros of % () will be given by the Sturmian Oscillation Theorem. 

For a simple value we consider the following cases: 

If 8; = 0 in (A, A) the zeros of ½ (r) and L,[u,(2)] coincide. Hence 
1% (x) has p + 2 zeros if p = 2m and p + 1 zeros if p = 2m +1. 

If 8; + Oin (X, A) or 81 (a) = 0, 61 (50) + 0 or 81 060) = 0, Bi (a) +O, 
we write the first boundary condition of (4) 

1% (c) () = * P(b), 
where 
PI () = (a) - Bila) K 

If PI () PI (U) O, %½ () ½% will be positive and (r) has an even 
number of roots. But the zeros of ½ (r) and LII u ()] separate. Hence 
1% (x) and Lil ()] have the same number of zeros on a<a<b. If 
Pi () P (U) <0, u,(a)u,(b) will be negative and u,(2) has an odd 
number of zeros. Hence ½ () will have one more or one less zero than 
Li Ius (4) Jon d h. 

It is also to be noticed that in Case I, since /, is on (Ap, Ni), % () can 


vanigh not more than p + 2 times nor less than p — 1 times. In Case II, 
since /, is on (Ap-1, Ap), Up (a) can vanish not more than p + 1 times nor 


less than p — 2 times. f 
Designating the condition PI ()-: Pi (h) >0 by P and the condition 
P,(a)-P,(b) <0 by P, we may state 


* Bécher, Legons, p. 50. 
+ Cf. Cor., p. 96 of the paper of the author referred to above. 
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OSCILLATION THEOREM IV. If ½% () is the pth characteristic function cor- 
responding to a simple value of the system (4) satisfying conditions I-VIII, 
then the number of zeros of u,(x) ona<a <b, for p= 1, is given by the 
following table: 


Case Nu mbe r of zeros 


A 2 41 

B~ p+ 2orp 

Bt P 
pt+.lorp—1 p+2orp 

p ＋1 
ptlorp-—1 p+2orp 

Cr 7 Pp — 1 
CF ptlorp—1 porp-—2 


Note: if in particular we choose 


b — x)aj(X) + (a—a)yil(A) 


— 4 


J. (æ, X) = (b — x) Bi (A) + (a — 4) (X) 


a; (, X) 


5 — 4 


the system (4) becomes identical with that of (4) of the paper by the writer 
to which reference has been made above. If 6; 6; > 0, it will be necessary to 
modify the boundary conditions of (4) by taking 
Li{u(a,r)] = — Ljfu(b,r)] 

with condition VII replaced by 

a;(a)Bo(a) — () By (a) = () Bi - a (b) 52 (b) = 1, 
if only one of the boundary conditions is modified. If both boundary condi- 
tions are modified, condition VII remains unchanged. In either case condi- 
tions I-VI and VIII will be satisfied. The oscillation theorems will be true 
with the modification that LI ()] will have an odd number of zeros. 
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NEW PROOFS OF CERTAIN FINITENESS THEOREMS IN THE 
THEORY OF MODULAR COVARIANTS* 
OLIVE C. HAZLETT 


1. Introduction. In a recent paper of minef it was proved that every 
modular covariant of a system of forms S (with variables 2 and /) is a poly- 
nomial in the universal covariant and modular invariants of the system of 
forms S (with variables S and 7) enlarged by the linear form nx — F/ which 
have been made formally invariant as to x and y. As pointed out in that 


paper, we have as a corollary the following: 

If N is the class of all modular concomitants of the system & which are formally 
invariant as to certain sets of coefficients and variables, but not formally invariant 
as to and, then the theorem tells us how to construct the set K“ of all modular 
concomitants which are formally invariant as to x and y in addition to being 
formally invariant as to those sets of coefficients and variables with respect to 


which K is formally invariant. 

Here x and y may be the variables of the system S or a pair of variables 
which is cogredient with the variables of the system or even a pair of variables 
which is cogredient with the variables aside from a power of the determinant 
of the transformation. In fact, every modular covariant of the set A’ is a 
polynomial in L and the concomitants of the set A which have been made 
formally invariant as to and /. Inthe present paper we give a few extensions 


and applications of this theorem. 


$$ 2-3. NEW PROOFS OF SOME FINITENESS THEOREMS 

2. Finiteness theorem for modular covariants. Dickson has already shown 
that modular invariants from their very nature have the finiteness property.t 
He has also proved that modular covariants have the finiteness property. 
We now proceed to give a proof of the finiteness theorem for modular covariants 
ditectly from the finiteness theorem for modular invariants. 

* Presented to the Society, October 25, 1919. 

tA Theorem on Modular Covariants, these Transactions, vol. 21 (1920), p. 247. 

tGeneral Theory of Modular Invariants, these Transactions, vol. 10 (1909), 

126. 
§ Proof of the Finiteness of Modular Covariants, these Transactions, vol. 14 (1913), 


300. 
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For, in the first place, every modular covariant of a system S of forms is a 
polynomial in Land the modular invariants of the enlarged system S“ which 
have been made formally invariant as to æ andy. Now the modular invariants 
of S“ have the “ finiteness’ property—i.e., they are all expressible as poly- 
nomials in a finite subset. This is not, however, the same as saying that the 
modular invariants of S’ which have been made formally invariant as to x 
and y have the finiteness property, since any one modular invariant of S’ pro- 
duces a number of modular invariants which are formally invariant as to 
* and /., Let Ii, Iz, „I, be modular invariants of S“ which are congruent 
to I when x and y are in the field and are formally invariant as to x and y. 
Moreover, let Ii be such an invariant of the lowest possible degree, say d, in x 
and / let Iz be of degree d + p" — 1, if there be any such;* let J; be of 
degree d + 2(p" - 1), if there be any such; and finally let J, be of degree 
d+(p"—1)*. From the proof of the fundamental theorem of the paper 
mentioned in the introduction, every invariant of S’ of degree d which is 
formally invariant as to 2 and / and which is congruent to I when x and y 
are in the field is of the form J; + L/,, where J; is a modular invariant of S’ 
which is formally invariant as to x and y and is of a degree in x and y which is 
less than d. A similar remark applies to invariants of S“ which are congruent 
to J when z and y are in the field and which are of degree d + (p" — 1), ---, 
d+(p"—1)invand . 

We can readily construct an invariant of S’ which is of degree d + p" (p"—1) 
and is congruent to J when æ and y are in the field, for QJ; is such an invariant. 


Similarly, we can express every modular invariant of S’ which is formally 


invariant as to 2 and / and which is congruent to J whenever x and y are in 
the field as a polynomial in II, Jz, , I., L, Q and modular invariants of 
lower order which are formally invariant as to æ and y. Hence, by induction, 
we prove the finiteness theorem for the modular invariants of S’ which are 
formally invariant as to 2 and /. 

In fact, we can so choose a fundamental set of invariants of S’ which are 
formally invariant as to 2 and y that to each invariant of a fundamental set 
of modular invariants of S there correspond at most “ invariants in a funda- 
mental set of invariants of S“ which are formally invariant as to x and y. In 
addition, we have to put L and Q in our fundamental set. We can, if we wish, 
think of Q as a modular invariant of S’ which is formally invariant as to x 
and y, arising from the modular invariant 1. Notice that the degree of 1 is 
congruent to p"(p" — 1) modulo p"— 1. Also, L can be regarded as a 

* It is to be noted that there is not necessarily any such invariant of degree d + p" — 1, 
nor of degree d + 2(p" — 1), ete. For example, if J be unity, the only homogeneous formal 
invariants congruent to J for values of x and y in the field are 1 and powers of Q = Ii. In 
this case there are no invariants J», I. „ I.. 

Trans. Am. Math. Soc. 10 
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modular invariant of S’ which has been made formally invariant as to x and y— 
it is a formal invariant arising from the modular invariant 0. At first there 
appears to be a discrepancy here, since the degree of L is * + 1 = 0 (mod 
p" —1). This discrepancy, however, is only apparent; for, if x and / are 
in the field, L becomes xy — xy = 0. In other words, L reduces to a quadratic 


which ‘‘telescopes,”’ so to speak. 
Hence, applying the fundamental theorem quoted in the introduction, we 


have proved 

TueoreM I. The set of all modular covariants of a system S of binary forms 
has the finiteness property—i.e., there is a finite number of covariants of the set 
such that every covariant of S is expressible as a polynomial in the covariants of 
the subset. 

3. Finiteness theorem for modular invariants of a system of forms and 
cogredient points. ‘The above theorem may, for convenience, be restated thus: 
Let K be the class of all modular concomitants of the system > which are 
formally invariant as to certain sets of coefficients and variables, but not 
formally invariant as to x and y; and let K“ be the class of all modular con- 
comitants of the same system I which are formally invariant as to x and / 
in addition to being formally invariant as to those sets of coefficients and 
variables with respect to which A is formally invariant. Here, as in $1, 4 
and / may be the variables of the system Tor a pair of variables which is 
cogredient with the variables of the system or even a pair of variables which is 
cogredient with the variables aside from a power of the determinant of the 
transformation. Then, if the set A has the finiteness property, the set K’ 
has the finiteness property. 

Now consider the set of all modular invariants of a system of forms S and 
the cogredient points (21, /), (2, y2), (ae, ye). In the body of the 
proof of the fundamental theorem of the Chicago paper,* it was proved that 
the set of all invariants of the system S and the cogredient points is identical 
with the set of all invariants of the system S enlarged by the linear forms 
— £141, 2 — +++, Me te — Ex ye where it is understood that 
now the £’s and 7’s are the variables and the 2’s and y’s are coefficients which 
are independent variables. That is, the invariants of S and the cogredient 
points are the invariants of the enlarged system S’ which are formally invariant 
as to the 2’s and y’s. 

‘Let S’ be the system N of the beginning of this section, and apply Theorem I 
as reworded above, making the set of modular invariants of S’ formally invari- 
ant as to the pairs (2;, y;) one ata time. By induction, we thus prove 

Tukokku II. The set of all modular invariants of a system S of binary forms 


*These Transactions, vol. 21 (1920), pp. 251-252 and p. 254. 
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and the cogredient points (a1, y1), (2, Y2), +++, (A, yx) has the finiteness 
property. 

This is the theorem of Professor F. B. Wiley’s Chicago dissertation;* but the 
present proof has the advantage of showing the relation between the modular 
invariants of S’ on the one hand and the invariants of the original system S 
on the other hand. It also shows the relation between the invariants of S 
and ik — 1 cogredient points, and the invariants of S and k cogredient points. 

This theorem includes as a special case the finiteness theorem for the modular 
invariants of any number m of cogredient (binary) points. For m = 1, 
Dieksonf has already shown that a fundamental set of invariants consists of 
L and . This affords a simple illustration of the fundamental theorem 
quoted in §1. For the invariants of the point (x,y) are the same as the 
formal invariants of the linear form 1 = nx — gy, where E and 7 are the vari- 
ables, and hence (by the theorem) are simply the modular invariants of / which 
have been made formally invariant as to z and /. Now there are two classes 
of linear forms /—(i) when x = y = , (ii) when x and / are in the field but 
(x,y) * (0,0). Accordingly we may take as a set of modular invariants 
which characterize the classes for J: (i) a function which is = 0 whenever x 
and y are in the field, (ii) a function which is = 1 when z and y are in the 
field, but (2, /) + (0,0), and = 0 when (2, /) = (0,0). By Dickson’s 
fundamental memoirf all modular invariants are linear combinations of these 
two functions. Now L is a formal invariant which satisfies condition (i), and 
is a formal invariant which satisfies condition (ii). 

If m = 2, a fundamental set of invariants§ for the Galois field GF [ p] is 


L; = 1 
Q L. r. yi | 


M = — y2%1, MI = a yi — 9½ Ms = — 


s+1 [p—s—1 —1)* 
N. A ) (l=s=p-2). 


M? 


The invariants Q; and X, are all integral functions of 21, 22, y1, ye. This 
gives another simple illustration of the theorem of my Chicago paper. For 
the invariants of the two points (21, /1) and (22, y2) are identical with the 
formal invariants of two linear forms Ii = m1 a1 — E and ly = no a2 — £ ye 
(where the variables are the £’s and y’s), which in turn are the modular 

*These Transactions, vol. 15 (1914), pp. 431-438. 

Madison Colloquium Lectures, p. 38; these Transactions, vol. 12 (1911), p. 1; 
Quarterly Journal of Mathematics, 1911, p. 158. 

t These Transactions, vol. 10 (1909), p. 126. 

§ W. C. Krathwohl, Modular invariants of two pairs of cogredient variables (Chicago disserta- 
tion), American Journal of Mathematics, vol. 36 (1914), pp. 449-460. 
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invariants of Ji and d which have been made formally invariant as to the 
's and y’s. Now the classes of the pairs of linear forms are 


(1) (1,41) = (#2, y2) = (0,0), 

(2) (i, %½1) = (0,0), (2, ye) in the field but + (0,0), 

(3) (ae, y2) = (0,0), (a1, y:) in the field but + (0, 0), 

(4), (41,41) = (a,b), (a2, yo) = (Aa, XD), X + O and (a, b) + (0,0), 


(5), (i, %) = (a,b), (a2, ½) = (e, d), where a, b, e, d are in the 
field, but A = ad — be + O. 


Now for the different classes, the invariants above take the following values 
le 0. M, II, M. N, 


Class 1 0 
Class 2 0 
Class 3 0 
Class 4, 0 
Class 5, 0 


Thus the theorem is verified for this special case. 


$$ 4-9. APPLICATION TO FORMAL COVARIANTS 


4. Alemma. In this section we will prove an important lemma which 
shows the intimate relation between modular covariants and formal covariants. 
This lemma is not new, but is a special case of Miss Sanderson’s theorem.* 
The present proof is given because it is elementary in nature and. because it 
furnishes a simple formula for a formal covariant which is congruent to a given 
modular covariant C whenever the coefficients are in the field. 

Let d(a,b,¢, +--+; «,y) = C be a modular covariant of the system S 
under the group @ of linear transformations with coefficients in the field 
GF[p"|. Moreover, let & be of index w. There is no loss of generality in 
assuming that & is pseudo-homogeneousf in the k coefficients a, b, %.. of 
degree d. For, if @ is not pseudo-homogeneous in the coefficients, it is the 
sum of a finite number of modular covariants which are pseudo-homogeneous 


in zhe u, h, e. 
First, we construct a function K which is homogeneous in the k independent 


*These Transactions, vol. 14 (1913), pp. 489-500. 

+ A function f is said to be pseudo-homogeneous of degree d if, when the arguments a, b, 
. «++ are multiplied by p, any non-zero mark of the field GF [ p"], the function F is multi- 
plied by p*. If f is a polynomial, this means that the degrees of the different terms of J differ 
at most by integral multiples of p" — 1. 


0 0 0 0 
0 1 0 0 
1 0 0 0 
l 0 
l I A 0 
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variables a, ), e,. and such that K = C whenever a, b, e, are in the 


field. We can take 


a, ö, e, 
( U 


K G (do, bo, co,; 4, %) 
do, Vo, Co, *** 
90 ) 


ao, bo, co, . 


Here 2 denotes the sum of ' all terms of the type indicated as the /-tuple 
(ao, bo, co, .) ranges over the k-tuples of a certain set o defined below. 


Inside the bracket, 
a, „ 2 
ao bo Co; 


a b 
a” be 


stands for 


aP (k—2) n h p(k-2) n 
ar 
a b 

be" 


a p(k—-2)n n 


ag bo 


904. bo, C0; 


ao, bo, co, 


and 


stands for the value of Q when a = ao, b = bo, ¢ = eo, «++. Miss Sanderson 
has already shown that Q = 0 if and only if a, b, ¢, . are not proportional 
to do, bo, co, .. The numerator of Q is the product of all essentially 
distinct linear functions of a, b, ¢, --+ in which the coefficients are marks of 
the field, and the denominator of Q is the product of all those essentially 
distinct linear functions of a, b,c, ... (in which the coefficients are marks of 
the field) which vanish whenever the a, b, , .. are proportional to ao, bo, 
a, b, e, 


is a polynomial of degree in 
do, bo, co, . 


co, . Notice that 20 
the coefficients a,b,c, , and thus K is homogeneous in the a, U, % . of 
degree d(p“-?”) which is congruent to d modulo p" — 1. f 

Into the set we put k-tuples (a;, bi, ¢;, )—where the a;, b;, ¢;, are 
in the field such that, if any particular Htuple (ar, bi, ¢;, ) is in the set o, 

These Transactions, vol. 14 (1913), p. 491. 

These Transactions, vol. 14 (1913), pp. 492, 493. 


* 
| 
c? eee 
cp eee 
er 
9 ee 
cP" eee 
＋ eee 
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then (pa;, ohe, per, .) is not in the set c when p is any non-zero mark of 
the field; and such, moreover, that if (a, 8, y, .. is any k-tuple of marks 
of the field, then there is in the set c some k-tuple (a;, b;, ¢;, «++ ) such that 
a = pa;, G = ohe, y = pe;, for p some non-zero mark of the field. 

If we now give toa, b,c, --- any set of values in the field, this set of values 
is of the form par, phy, where (d, bi, ) is a k-tuple of the 


seta. Then K has the value 
Q a, 01, ei, d 
ao; bo, „ 


9 bo, Se, 
2 


ao, bo, 


(K-) u 


= G (ai, bi, a, 
= (pm, ph, 5, 


since C is pseudo-homogeneous in a, , , of degree d. 

If we now subject the variables « and y to any transformation of determinant 
A of the group , the indeterminates a, b,c, . are subjected to an induced 
transformation which carries any particular /-tuple of the set c into a k-tuple 
of the form (odr, pb, where (di, br, is a k-tuple of the 
set c and p is some mark of the field. Let the indeterminates a, b, e, . go 
into a’, b’, ce’, «++ and let do, bo, co, «++ go into pa), pb, peo, . where 
(ah, bs, ch. +++) isa k-tuple of the set c. Since C is a modular covariant of 
index w which is pseudo-homogeneous in a, b,c, of degree d, 


Ar (uo, bo, co, „ /) o (ah, bo, ch, 


* 


At the same time, since (a, b, e,. .), (%, b®",c?", ..), ete. are cogredient, 


a, b, e. 1 wee 
( ’ ’ = —¢ 
do, bo, W. (k-1) n bi, 
( = p? 
1 d 
, , , 7 1 


Av K (ay, bi, cb, 4, 17 — 
| ( 


and 


Thus 


— 


c’, oes 
, , 
G (ao, bo, coe 


— 
1 
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Notice that in this congruence the a, ö, e, --- and the a’, b’, c’, . are two 
sets of indeterminates. Thus the lemma is proved. 

5. A theorem on formal covariants. Let C be a modular covariant of the 
system S of forms with coefficients a,b,c, ---. As above, there is no loss 
of generality in assuming that is pseudo-homogeneous of degree d in the coef- 
ficients as well as homogeneous in the variables x and y. By the preceding 
section, we know that there is at least one homogeneous formal covariant A of 
the system S which is congruent to C whenever the coefficients a,b,c, --+ are 
marks of the field. Let Ao be one such covariant of lowest degree w in a, b, 
c, +++. If there is a second covariant K which is of degree w ina, b,c, --- 
and which is congruent to C whenevera,b,c, --- are in the field, then A — Ko 
= Kk, is a homogeneous formal covariant of S which is congruent to zero 
whenever the coefficients are in the field—that is, A, vanishes for all classes 
of forms of the system S. There are two possibilities: (1) A; is the product 
of two or more rational formal covariants of which none vanishes for all sets 
of coefficients in the field; (2) A, contains as a factor a formal covariant 
which can not be expressed as the product of several covariants and which 
does vanish for all classes of S. 

Thus any formal covariant K of the system S is expressible in one of the 
two following forms: 


(1) K = RO ＋ MI M M,, 


where Mi, Ms, ---, M; are formal covariants of S which do not vanish for 
all sets of values of the coefficients in the field; 

(2) RK RO ＋ 

in which V and Rz are formal covariants of S where vanishes whenever the 
a, h, e, . are in the field and is not the product of two or more formal 
covariants of S. Notice, moreover, that we can use the same formal covariant 
No for all formal covariants which are of the same degree in a, %, e, and 
of the same order in x and y and which, moreover, are congruent to the same 
modular covariant C when a, b, e,. are in the field. 

Now let = denote a set of formal covariants of S determined in the following 
manner. Consider any particular modular covariant C of S (in which all 
the exponents of a, h, , .. are = p" — 1) and the totality of all formal 
covariants K which are congruent to C whenever the coefficients are marks of 
the field. The degrees of these covariants & will be of the form w + g(p" — 1) 
where q is a positive integer and w is the least such degree. For each such 
Do this for every 


degree, choose one formal covariant & to put in the set >. 


*It must be borne in mind that there are not necessarily any formal covariants for each 
such degree. Compare the third footnote in § 2. 
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modular covariant C of a fundamental set of modular covariants of S, and 
let © denote the set of all such formal covariants. 

Then, proceeding by induction, we see that every formal covariant of S is a 
polynomial in the covariants of the set ~ and in those irreducible formal 
covariants of S which vanish whenever the coefficients a, h, %.. are marks 
of the field. Thus we have proved 

TuroreM III. Hevery formal modular covariant of a system & of binary forms 
with respect to the Galois Field GF | p" | is a polynomial in the modular covariants 
which have been made formally ineariant as to the coefficients of the forms, and 
in the irreducible covariants which are congruent to zero whenever the coefficients 
are marks of the field. 

It will now be interesting to give a more elegant proof of this theorem by 
the aid of a symbolic notation. 

6. Symbolic. Notation. In the symbolic theory of algebraic invariants, 
Aronhold, Clebsch and Gordan* express any binary form f of order m as the 
mth power of a symbolic linear function of the variables thus, 


7 ＋ ay Yo (ar ＋ “* = ar. 


Then do = aj’, di = M ae, and in general a, „(a- a. In order 
that the a’s may be independent variables, the agreement is made that we 
never use any term of more than the mth degree in the a’s. Accordingly, if 
we wish to express a product of two or more a’s in symbolic form, we have to 
introduce two or more equivalent sets of symbols, say (ar, q), (81, Be), ete. 
Then f = a! = = y" = --+; and thus we can write do de as (mBi'" Be) 
or as G Cπν a), ete. Then every polynomial in these symbols which has 
the invariantive property will be an invariant of f; though, in order that 
such an invariant may be rational and integral in the a’s it must be homo- 
geneous of the mth degree in the a’s, homogeneous of the mth degree in the 
3's, ete. 

In the theory of modular invariants (formal and otherwise), we can not, 
however, use this classical symbolic notation for the binary form f ovir the 
general Galois field GF [ p"|. In the first place, the binomial coefficients which 
naturally arise in this way may be zero in the field. In the second place, we 
know that L, = (a“ c = a a — a, a" is an invariant symbol, since 


at", a" are cogredient with ar, a2. But in case “ + 1 > m, we have no 


right’ to use this symbol, as explained above. Accordingly we must adopt 


some other symbolism. 


*Aronhold, Journal fiir Mathematik, vol. 62 (1863), p. 281; Clebsch, 
Journal fiir Mathematik, vol. 59 (1860), p. 1; Gordan, Mathematische 
Annalen, vol. 2 (1870), p. 227, vol. 5 (1872), p. 595; Clebsch, Theorie der bindren alge- 


braischen Formen (1872); Gordan, Vorlesungen über Invariantentheoric (1887). 


1921] FINITENESS THEOREMS 


With this in mind, represent the binary form as 
7 = II (ai 1 Qe 42) = ( ay 71 -+- Qe Xo) (81 + 82 2) —— = IIa. 82 


where it is understood that there are m distinct symbolic factors. Now every 
invariantive function of the variables and the coefficients of f is an invariantive 
function of the symbols ar, a2, 81, G2, Yi, Y,. and r, #2. Conversely, 
an invariantive function C of the symbols is a symbolic covariant of f; and 
if it is rational in the a’s it is actually a covariant. If, however, C is to be 
rational in the a’s it must be such that, if we interchange any two pairs of 
symbols—say (ar, az) and (1, 2) —it is unchanged in form, and every 
term of C must be of the same degree in the a’s that it is in the f’s, ete. If 
these two conditions are satisfied, then conversely C is rational in the a’s. 
For C is then a symmetric function of the pairs (ar, q), (81, 82), «++ and 
hence, by the theory of svmmetric functions, is well-known to be rational 
in a certain finite set of such functions, which are simply the a’s. 

7. Symbolic proof of the theorem of section 5. As in the theory of alge- 
braic invariants, we notice that (ar, a2), (81, 82), , are what we might 
call pseudo-cogredient with (, — 41) that is, (ar, az), (81, G2), «++ are 
cogredient with (, — 21) aside from a power of the modulus. For, if 41 
and are subjected to the non-singular transformation 


(i = ar, + bee, 


la A = ad — be O, 
25 


then, since ai %1 + a2% = aa) + an, 


+ d- al. 


Hence every modular covariant of a system S of forms is a modular invariant 
of certain pairs (ar, az), (81,82), +++, (ae, — 41) which are pseudo- 
cogredient; or every modular covariant of the system S may be regarded as a 
modular invariant of certain cogredient pairs (ar, a), (81,82), --:, 
(a2, — ). The converse is also true, though not every invariant of the 
symbolic pairs is a rational covariant of the system S. As proved at the end 
of § 6, a necessary and sufficient condition that an invariant of the symbolic 
pairs be rational is that it be symmetric in these pairs. A similar remark 
applies to the covariants. 

In fact, by the proof of the fundamental theorem, any formal modular 
covariant C of a single binary form F is of the form MI, where M is a 
modular invariant of the n + 1 pairs which has been made formally invariant 
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and V is a formal invariant of the n + 1 pairs which vanishes whenever the 
a’s, B’s, .. are all in the field. This M can be taken as the same for all 
covariants C which are congruent to the same modular covariant when the 
a’s, GS, . are all in the field. 

It can readily be proved that we can so choose M and V that they are sym- 
metric in the pairs (ar, q), (81, 62), ---. For, by the proof of the funda- 
mental theorem, C = M,+ L. Mie, where M., and Mi are two formal 
modular invariants of the n + 1 pairs. Similarly C = M, + Lg Mig, ete. 
Now Ma and Mis can be so chosen that they are obtained from M. and Mia 
respectively by interchanging the pairs (ar, a) and (Bi, 82). Hence 


C = Mig (I. V. + Ly N, + I. Lp Nes); 


where Vi is obtained from N. by interchanging the pairs (a1, a2) and (81, Bz) 
and where Mis and Nis are symmetric in these two pairs. By induction, we 
see that the statement at the beginning of this paragraph is true. 

With slight changes, this proof holds for a system of binary forms. 

It is to be noted that the symbol L vanishes whenever a; and az are marks 
of the field; similarly with Le, ete. But any formal modular invariant which 
evanesces whenever the a’s, 6's, etc. are all in the field (such as those of the 
type L. N. + Lg Ng + IL. La Nag + +++) does not necessarily vanish when- 
ever the coefficients of the system S are in the field. Hence we have proved 
Theorem III. 

This second proof, besides beauty of form, has the advantage of indicating 
the relation between formal modular covariants of two forms of different 
degrees. Although it has the obvious disadvantage that it does not furnish a 
definite formula by which we can determine the formal modular covariants 
of a system of forms, nevertheless it suggests that some day there may be 
evolved a symbolic theory of formal covariants. 

8. Application to the binary quadratic, modulo 3. Dickson“ has shown 
that a fundamental set of modular invariants of the binary quadratic, 


fe = ao 2, + 20 21 42 + a2 23, 
modulo 3 is 


A = aj — aa, q = (ao + a2) (ai + ao a2 — 1); 
while he has shown? that a fundamental set of formal invariants consists of 
A= di —aa@, J (do + a + a2) (d + 2a, + ae) d, 
B = a, (a, + ao) (ai — ao) (2d + u:) (2a, + a2) (ai + a2), 


T = (do + ae) (20 + 2a, + ae) (2 + a, + a2). 


* These Tr ans actions, vol. 8 (1907), p. 209. 
+ These Transactions, vol. 14 (1913), p. 310. 
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To verify Theorem III for the invariants of f, we compute the values of 
q, 4, J, B and T for the different classes and find that 

Class f: B 
0 
0 
| 0 
| : 0 
2 0 


1 — 


N Co 


— 
~ 


Hence, whenever the a’s are in the field, T= — 9, J = 24°+A, B=0O. 
Dickson? has also shown that a fundamental set of modular covariants 
consists of 
J, A, L, fo, fs = ao rt + ar (aia, + 2122) + a2 =, 
Cy = (a? ay — a3) x? + (do — dz) (di + d d) 1 + (ai — ay a3) 23, 
Co = (ai + A) ai + di (ao + a2) a1 ＋ (a3 + A) az. 


Glenn* later showed that a fundamental set of formal covariants consists of 


eighteen forms: 
A, J, B, T, L, Q, fe, fas fo = d i + 21 ai 42 + d 2, 
Ci, Co, Cy = (a5 + A) + 21 (ao + a2) (Ki + (a + A) 


(az di — df) at — (d — dz) (af + ao dz) (&i + 23) 


+ (ai — a, a3) 22, 


= (a, a; - a}) a} + (ao — dz) (af + ao d) + — a, a3) 22, 
= ay ai ＋ 2a axe + a3 x2, 
bs = + ai (aia, + a3 22, 
92 = (ao di — di) ai + (ao d: — d) + (a, a3 — di dz) 23, 
= (az a} — ai) ai + (a3 a? + 2a) as + 20% at + at az + 2a) a3 
+ aj + 2aj az + ao d) * + (a1 — di 


Of the formal covariants not in the fundamental set of modular covariants, 
7. C. = (AT 1 Ii + Ci, — de = Ci, Ci C2, Go = fo and . =. 
whenever the a’s are in the field. Finally, d: = Ez = 0 whenever the a’s are 
in the field. Thus the theorem is verified. 

*These Transactions, vol. 20 (1919), pp. 154-168. For convenience, we have 
used C. instead of Da, which is legitimate, since D. = C. — f?. 
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The writer has also applied the theorem to the set of fundamental formal 
covariants of the binary cubic, modulo 2, which has been found by Glenn.* 
We leave the details to the reader. 

9. A general application. ‘Theorem III enables us at once to narrow down 
the question of the finiteness of formal covariants of a system S of forms with 
respect to the Galois field GF | p"|. There is no loss of generality to consider 
only homogeneous formal covariants and invariants. 

Let Oi be a homogeneous formal invariant of lowest degree q which is = 1 
whenever the coefficients are in the field, but not all zero. Now there are 
polynomials in the coefficients which are = 1 for all sets of values of coeffi- 
cients in the field not all zero and which are formally invariant. For let 
Ii, lg, „ I, be the characteristic modular invariants for all classes except 
the one in which all forms are identically zero, and let II, I, , I. be formal 
invariants of lowest degree which are congruent to I, [2, «++, I, respectively 
whenever the coefficients are in the field. Let the degrees of II, II, , I. 
be di, dz, +++, d, respectively. Then, if Y be the least common multiple of 
the d's, 


+ + + J.“ (3. = D/d;) 


is a homogeneous formal invariant which is congruent to 1 whenever the coef- 
ficients are marks of the field not all zero. 

Now let C be any modular covariant of the system S of order w, and let ( 
be a formal covariant which is congruent to C whenever the coefficients are 
in the field; moreover, let (i be such a covariant of order w and of lowest 
degree in the coefficients. Also, let (. C2, --+, Cy be formal covariants of 
order w which are congruent to for all sets of the coefficients in the field and 
which are of all possible degrees ranging from en up to but not including ei A. 
Let their degrees be respectively Gi On, „. 

If A; be any formal covariant of order e which is congruent to C whenever 
the coefficients are in the field, but not all zero and which is of degree es, 
then A; is identically equal to C; + (a formal covariant which is congruent to 
zero Whenever the coefficients are in the field). 

Now ©, Qi, «++, Ce Qi are all formal covariants which are congruent to C 
for all sets of coefficients in the field. The degrees of (Qi, ---, Cx Qi are 
City, tq, +++, +4 and range from ¢; + up to but not including 
e + 2y. If there be any formal covariants of order w which are congruent 
to Gand which are of a degree different from ei + 4, ¢2 +9, „ Ce +4 
and yet whose degree lies between ¢; + g and ¢; 2, select one representative 
covariant of each such possible degree. Let these additional covariants be 
denoted by Ii, «++, -4). Since there is only a finite number of integers 

*These Transactions, vol. 19 (1918), pp. 109-118. The invariants had already 
been found by Dickson (Madison Colloquium Lectures, p. 56). 
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between ¢; + and ¢; + 2q, there is only a finite number of such additional 
representative covariants. 

Thus if A’ be any formal covariant of order w which is congruent to C 
whenever the coefficients are in the field and which is of a degree between 
ci +q and ei 2%, then K“ is identically equal to (some C.,) X Qi + (a 
formal covariant which is congruent to zero whenever the coefficients are in 
the field); or A’ is identically equal to (some A.) + (a formal covariant 
which is congruent to zero whenever the coefficients are in the field). 

Proceed similarly with the formal covariants of order o which are congruent 
to C whenever the coefficients are in the field and which are of a degree between 
ci + 2% and ei + 3q¢; between ei + 3q and ei + 4q; ete. 

Thus we have a set of those formal covariants of order w which are con- 
gruent to C whenever the coefficients are in the field which is such that there 
is one and only one such covariant for each such possible degree. Moreover, 
in view of the preceding argument and in view of the fact that there is only a 
finite number of additional representative covariants (such as A;, --+, 47, 
etc.), all the representative covariants of order w which are congruent to Care 
expressible as the product of a power of O by one of a finite number of these 
representative covariants. 

Combining these results, we see that all formal covariants of order w which 
are congruent to C for all sets of the coefficients in the field are expressible in 
terms of Or, a finite number of such covariants, and the formal covariants of 
order w which are congruent to zero for all sets of the coefficients in the field. 

By a similar line of reasoning, we can deduce that all formal covariants of 
degree d which are congruent to C whenever the coefficients are in the field are 
expressible in the form (one of a finite subset of such covariants) X (a power 
of ) + (a formal covariant which is congruent to zero whenever the coef- 
ficients are in the field). 

If we now combine these two results, and make use of the fact* that the 


set of all modular covariants possesses the finiteness property, we then prove 


by induction 

TneoreM IV. The set of all formal covariants of a system S of binary forms 
with respect to the Galois field GF | p" | is of such a nature that every such covariant 
is expressible as a polynomial in , Qi, members of a finite subset of formal 
covariants of S and the irreducible covariants which are congruent to zero whenever 
the coefficients are in the field. 

Thus we reach the conclusion that the set of all formal covariants of a 
system S possesses the finiteness property if and only if the set of all irreducible 
covariants which are congruent to zero for all sets of coefficients in the field 
possesses the finiteness property. 

Mount Hotyoxe 
Sourn Hapiey, Mass. 


* See section 2 of this paper. 
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ON THE CONVERGENCE OF CERTAIN TRIGONOMETRIC 
AND POLYNOMIAL APPROXIMATIONS" 


BY 


DUNHAM JACKSON 


1. Introduction. Let f(x) be a continuous function of period 27. Let 
Tun (A) be the trigonometric sum of order M or lower for which the integral 


21 
f (2) — 7. (4) dx 
0 

has the smallest possible value, when m and n are given. The writer has 
recently proved the existence and uniqueness of 7% () for any positive 
integral value of n and for any value of m = 1, whether integral or not, f and 
has given some indication of the behavior of 7% (), for fixed n, as m becomes 
infinite. 

The first purpose of the present paper is to discuss the convergence of 
Tun () toward the value f (), when m is held fast and n is allowed to become 
infunite. It is found to be a sufficient condition for uniform convergence 
that limo (&) / = 0, where is the modulus of continuity$ of the 
function f (2), that is, the maximum of F (“) -(“) for |2’ - “ S6. 
The proof makes use of Bernstein’s theorem on the derivative of a trigonometric 
sum. For m = 1, the condition as stated would require that F (2) be a con- 
stant, but in this case it is sufficient that f () have a continuous derivative. 

One would be inclined to expect that convergence would be more likely for 
large than for small values of m, and the result obtained is favorable to this 
view, as far as it goes. The theorem is less general, however, even for large 

* Presented to the Society, September 8 and October 30, 1920. 

+ For m > 1, see D. Jackson, On functions of closest approximation, these Transac- 
tions, vol. 22 (1921), pp. 117-128; for m = 1, D. Jackson, Note on a class of polynomials 
of approximation, submitted to these Transactions. These papers will be referred 
to by the letters A and B respectively. In A, the treatment applies to a general class of 
approximating functions, including trigonometric sums as a very special case; in B, the dis- 
cussion relates primarily to polynomials, but attention is called to the fact that the case of 
trigonometric sums is to be dealt with in the same way. 


t Loc. cit. A, §§ 8, 9. 
For the name, ef. de la Vallée Poussin, Legons sur l’approximation des fonctions d'une 


variable réelle, Paris, 1919, pp. 7-8. 
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values of m, than the well-known Lipschitz-Dini condition in the case of 
Fourier’s series, m = 2, and the present results may for this reason be regarded 
as only preliminary. 

The latter part of the paper contains a discussion of the corresponding 
problem in polynomial approximation. The method of treatment is generally 
the same, though the details of the reasoning and the conclusions reached are 
not exactly parallel in the two cases. 

To return to the trigonometric case, it will be supposed throughout the 
discussion that m has a fixed value = 1, given in advance. For the sake of 
simplicity, the subscript m will be omitted, and 7% (a) will be denoted 
simply by T. (). This particular trigonometric sum will be referred to 
briefly as the approximating function of order n. 

2. Reduction of the problem. If 7’, (2) is the approximating function for 
f(x), and t (x) is an arbitrary trigonometric sum of order n or lower, the 
approximating function for f() ＋ % (&) will be T. () ＋ % (r). For the 
error is identically the same in both cases, and the integral of the mth power 
of its absolute value is the same, and if it were possible to diminish the value 
of the integral in one case, it could be diminished in the other case as well.* 

Suppose there exists a trigonometric sum . (2) such that 


(2) Se 
for all values of x, where e is some small positive quantity. Let 
$(x) =f (2) —th(2). 
Then the error of the approximating function for & () is the same as the 
error of the approximating function for f (r), and the problem of determining 
the magnitude of the error in the case of f is reduced to the corresponding 
problem for &. 

3. Relation between maximum and integral. Let e be an arbitrary positive 
quantity; let & (r) be an arbitrary continuous function of period 27, subject 
to the condition that 

(r) Se 
for all values of x; and let f() be an arbitrary trigonometric sum of order 
not exceeding u. Throughout this section, n will be fixed in value, and need 
not be indicated as a subscript. Let u be the maximum of |t(2)|, and let ao 
be a point such that f ( =m. Finally, let it be supposed that u = 4e. 
If u < 4e, this fact in itself will serve the purpose for which the conclusion 
of the present section is to be used in the contrary case. 


* The relation between given function and approximating function is of course not linear 
in general; that is, the approximating function for the sum of two given functions is not 
generally the sum of the approximating functions. 
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A theorem of S. Bernstein,“ the importance of which is only beginning to 
be realized, states that the maximum of the absolute value of the derivative 
of a trigonometric sum of order n can not exceed n times the maximum of the 
absolute value of the original sum itself. That is, in the present case, 


Sun, 
for all values of 2. When 


(1) 


therefore, 
lt (a) — t (20) | — Zu, 


by the mean value theorem, and 
t 
Since, furthermore, & S and e =y/4, 
t - 6 (4) 


throughout the interval (1); of length 1/n. From this it follows that 


lt(x) — d(x) |" dx 2 


4. Proof of convergence. Now let , (r) be the approximating function 
of order n for & (), let un be the maximum of , (), and let 


Yn | T(x) - d(x) * d. 


If , (x) were replaced by zero in the last integral, the value of the integral 
would be less than or equal to 21e“, since |¢| Se. But 7, () has the 
property of giving the integral its minimum value, and so it is certain that 


un S 4e (21) / 


* See, o.g., de la Vallée Poussin, op. cit., pp. 39-42; also S. Bernstein, Sur l’ordre de la 
meilleure approximation des fonctions continues par des polynomes de degré donné, Mémoire 
couronné, Brussels, 1912, p. 20; and de la Vallée Poussin, Comptes Rendus, vol. 166 
(1918), pp. 843-846. In Bernstein’s original statement of the theorem, the factor introduced 
on differentiating was 2n, instead of n. 


42 
= 
— 1 1 m 
= ore’. 
By the preceding section, on the other hand, if u. = 4e, 
1 u, m 
4 
So 
1 u, m 
„ (4) = Zee", 
4 
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But if u, < 4e, the last inequality is satisfied a fortiori, and the relation is 
therefore true generally. To summarize: 

If @ (x) is a continuous function of period 2 which never exceeds € in absolute 
value, 2 (x) the approximating function of order n for d(x) , and un the maximum 


of r. (), then 


where km is a constant depending only on m. 
Let p, be the maximum of r, (x) - ()]; then p, satisfies an inequality 
of the same form as u.. For p, S un + é, and so 


Pn = (km ＋ 1) u E. 


In consequence of § 2, the last assertion has the following further significance: 

If f (x) is a continuous function of period 2x which can be represented by a 
trigonometric sum of order n or lower with an error not exceeding en, if Tu (2) is 
the approximating function of order n for f (x) , corresponding to the exponent m, 
and if pn is the maximum of f() - 7. ()], then 


(2) Pn = (him + 1 un En» 


where km has the same meaning as before. 

Hence 7, (æ) will converge uniformly to the value f(x), if f() can be 
represented by trigonometric sums for successive values of n in such a way 
that 

lim Vn en = 0. 

Let w(6) be the maximum of f (“) — f(2’’)| for “ “ =6. Then 
there will exist representations“ of f () such that e, S c (27 /n), where e is 
independent of n, for all positive integral values of n. If 


lim -o (ö) / V = 0, 


then (21 /n) will approach zero as n becomes infinite, and the condi- 
tion for convergence will be satisfied. 

In the case m = 1, the condition lim o w(6)/5 = 0 would require that 
f(x) be a constant. There will exist representations} of f (x) such that 


lim. %é = 0, 


however, provided that f(x) has a continuous derivative, so that the last-named 
condition is sufficient for convergence in this case. 
If w(6) approaches zero more rapidly than is required by the criterion for 


Cf., e.g., D. Jackson, On the approximate representation of an indefinite integral, etc., these 
Transactions, vol. 14 (1913), pp. 343-364; p. 350. 
Cf., e. g., the paper cited in the preceding footnote, pp. 350-351. 
Trans. Am, Math. Soc. 11 
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convergence, the relation (2), together with such general theorems on trigono- 
metric approximation as are contained in the paper just cited, will give informa- 
tion with regard to the degree of convergence of the present approximating 
functions. 

It may be remarked that the reasoning above, as applied to the case m = 2, 
gives a new proof of the convergence of the Fourier’s series for an extensive 
class of functions, although, in view of its dependence on Bernstein’s theorem 
and on the general theory of approximation by trigonometric sums, the proof 
is not more elementary than others that are well known. 

5. Polynomial case, end-points included. The theorem on the derivative 
of a trigonometric sum, on which the preceding proof is based, has a counter- 
part relating to polynomials. Let , (r) be a polynomial of the nth degree 
or lower, such that P, (r) Su for —-1=2=1, where u is a constant. 
Let x = cos 6; then , (2) is a trigonometric sum in 6, of order u or lower, 
the absolute value of which is = u for all values of 6. By the theorem for 
the trigonometric case, 


l 
(cos @) = sin P, (cos 0) Sun. 
dé 


That is,* 
Vl — P. Sun 


for- 12251. 


By means of a linear transformation on æ, it can be inferred further that i 
P. (r) Su for a S b, and ifa<a’ <b’ <b, then 
(3) P. () S Cun 
for a’ = x = b’, where C depends only on a, b, a’, and b’. 

The immediate application of this result to the problem of convergence is 
not apparent. It is possible to reason further, however, as follows. Let it 
be supposed once more that the interval in question is ( 1,1). The func- 
tion P, (cos 60), being a polynomial of degree n — 1 at most in cos 0, is a 
cosine-sum of order n — 1 at most in 9. Since 


cos sin @ = 3{[sin (k +1)6 — sin (k — 1) 8] 
for all values of Y, the expression 


Q, (6) = sin ., (cos 0) 
has the form 
Q, (0) = Ay sin + sin 20 + + A, sin 10. 


„S. Bernstein, loc. cit., pp. 6-11, proves this theorem directly, and makes it the basis for 
the discussion of the trigonometric theorem; but it is simpler to begin with de la Vallée Pous- 
sin’s proof for the trigonometric case. 


19211 TRIGONOMETRIC AND POLYNOMIAL APPROXIMATIONS 163 


Let the maximum of |Q,(@)| be denoted by ui; the theorem of Bernstein 
for trigonometric sums is applicable again, with the conclusion that 


(%) Sunn 
for all values of 0. As Q, (0) = 0, it follows that 
0, (0) S unn. 
1 1 


Let @ have a value in the interval (— 37,47); then 


sin 6 2 
6| =2\sin 6, 


and 


(4) 


The same result can be obtained for 7/2 = x = 32/2 from the relations 
%%) O, Sunn zl, 
S 2 sin 6. 
The relation (4) therefore holds for all values of 6. But 


| Qn ( 6 ) >’ 

(cos 0) = P. (), 

and, on the other hand, ui = un. Hence it follows that“ 
P. ()] S Qun? 


for — lr S1. If P., (r) S u for a S S b, the conclusion is that 


— 57 
(5) \P.(z)| = 
throughout the same interval. 

Now let F (r) be a given continuous function for a = x = b, let m be a 


given fixed number = 1, and let P, (2) be the polynomial of the nth degree 


b-—a 


or lower for which 


| lf (x) (2) dx 


isa minimum. For reasons analogous to those which appeared in connection 
with the trigonometric problem, if p, (2) is a polynomial of degree n or lower, 
such that 

Al< 

Se 

This result is contained in a theorem of A. Markoff (see S. Bernstein, loc. cit., pp. 11-13), 

which states that P, () = un? throughout the interval. If it could be shown above that 
0, (%% (sin 6)| Su: n, the present method would prove Markoff’s theorem completely. 


| 

0. (600 „ 

77 
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throughout the interval, e being small, the error of the approximating poly- 
nomial P,, (2) for f(a) is the same as the error of the approximating poly- 
nomial for 
=f(r) - 
and the latter is easier to deal with. 
Suppose, then, that ¢ (2) is an arbitrary continuous function such that 
Se 
throughout (a,b). Let , (A) be the approximating polynomial of the nth 
degree for & (r), let u, = [wn (Ao) be the maximum of | z, () in (a,b), 
and let it be assumed for the moment that u, = 4e. 
In consequence of (5), if 
5 — 4 
it will certainly be true that 
(4) — 1, (o) 
1 ( 
(4) — () 
The last relation holds throughout an interval of length at least (b—a)/(Sn?), 
and it is therefore certain that 
7. = (r) — G (r) * dx = 
On the other hand, y, is at any rate not greater than the value that would 


be obtained if , (2) in the integral were replaced by zero, that is, 


= (b—a)e". 


( — 4) 
8M ( 4 ) ( a) 


(6) Mn — de 722 m 


Hence 


The last relation has been obtained on the hypothesis that u, = 46, but it is 
clearly satisfied also if u, < 4e. 

Lat p, be the maximum of r, (27) — & r); then p, = u, + ée, while, 
at the same time, if & () has the form f() - p, (2), px is the maximum of 
the difference between f (.) and its approximating polynomial P, (2). Thus 
the following assertion is justified: 


If f(x) ts continuous for a S Sb, and can be represented throughout the 


interval by a polynomial of degree n or lower with an error not exceeding en, if 


2 
2 
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P,, (x) ts the approximating polynomial of degree n for f (x2), corresponding to 


the exponent , and if p is the maximum of \f (x2) - „ (), then 


m 


Pn = Cm n- Ens 


where c is a constant depending only on M. 

The constant c corresponds to that which was denoted by /,, + 1 in the 
trigonometric case; the difference in notation has no significance. 

The polynomial P, (2) will converge uniformly to the value f(2) for 

x = bh, if it is possible to make e, take on such values that 
lim ‘Vn? - e = 0. 

Let w(5) be the modulus of continuity of f(), as before. There will 
exist polynomial representations“ of F (r) such that e =e’ w[(b — a)/n], 
where c“ is independent of u, for all positive integral values of n. A sufficient 
condition for uniform convergence throughout the interval a= x= b is that 
lims—o w (6) Ve = 0. 

For m = 2, the condition just stated would require that f (2) be a constant, 
but it is easy to replace this condition by a more satisfactory one. If 
1 < m = 2, let it be assumed that f (r) possesses a continuous first derivative 
having the modulus of continuity o (&). It is possible to maket 


5 — 4 
— 
n n — 17 


, once more, is independent of n. For 1 < m S 2, a sufficient con- 


where c“ 


dition for uniform convergence throughout the interval a = x = b is that f(x) 
have a continuous derivative, and that limo oi (6)/6°' = 0, where o (6) 
is the modulus of continuity of the derivative. In case m = 2, the condition 
thus obtained is simply that f (2) have a continuous first derivative. 

For m = 1, a sufficient condition is that f (x) have a continuous second deriva- 
tive.t 

If F (2) possesses a greater degree of regularity than is required for mere 
convergence, theorems on the degree of convergence of the approximating 
polynomials can be written down at once; it is not necessary to dwell on the 
point further. 

6. Polynomial case, end-points excluded. The relation (6) once having 
been obtained, it is possible to make use of (3) to some advantage in connection 
with the convergence at interior points of the interval. 

* See, e.g., D. Jackson, these Transactions, vol. 14, as already cited, pp. 353-354. 

+ D. Jackson, these Transactions, vol. 14, as cited, p. 354. 

t Cf. preceding citation. 
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Let the notation leading up to (6) be kept in force, let a’ and b’ be two 
numbers such that 
a<a<vb’<b, 
and let a. , (1) be the maximum of | 7, () for a’ = x = b’, with the 
assumption, temporarily, that u. = 4e. 
By (6) and (3), 


(&) — (1) S 40e . 8½ 1 = Cle - — il, 


as long as x is in (a“, b’); the constant C’ depends only on , d, 0, a“, and b“. 


If 


(7) E — 11 = 


— — — — 
20 


x will surely be in (a“, )“), on one side of 2, at least, for u = np (m,a,b,a’,b’), 
because the right-hand member of (7), in consequence of (6) and the fact that 
u, S Mn, can not exceed 2. 8¼%% (Cn); and it will follow further that 


1. (&) — (21) 
(r) 
(4) 6 (4) S Tu. 


These relations hold throughout an interval, the length of which is at least 
equal to the right-hand member of (7). 
This means that 


u. u. m 
Yn = 1 (4) - & dæ (4) 


But it is still true that 

Yn = (- a) 
Hence 
< C n@imt 


qm (m+ 2)/[m(m-+-1) 


eon 

the coefficients C“ and ““, like the preceding C’s, depending only on m, a, 
b,a’,andb’. A relation of the same form continues to hold if u, < 4e. 

There is perhaps no need of setting down at length the resulting theorems on 

convergence and degree of convergence in the interior of the original interval. 

The'results of the section are summed up by saying that the exponent 1/m 

of the trigonometric case, which became 2/m in the preceding section, is 


replaced here by 


Tue University or MINNESOTA, 
MINNEAPOLIS, MINN. 


1 m+2 
mm+i- 


DETERMINATION OF ALL GENERAL HOMOGENEOUS POLYNOMIALS 
EXPRESSIBLE AS DETERMINANTS WITH LINEAR ELEMENTS* 
LEONARD EUGENE DICKSON 


1. The general quadratic forms in three and four variables can be trans- 
formed into x; 2 — x} and 2; 22 — 23 * respectively, and hence are expressible 
as determinants of order 2. Since any binary form of degree r is a product 
of r linear forms, it is expressible as an r-rowed determinant whose elements 
outside the main diagonal are all zero. 

It was proved geometrically by H. Schrétert and more simply by L. Cre- 
monat that a sufficiently general cubic surface f = 0 is the locus of the inter- 
sections of corresponding planes of three projective bundles of planes: 


+ Alig + = 0 (i 1, 2, 3), 


where x, X, A are parameters and the /;; are linear homogeneous functions§ of 
„ Hence f= 0 has the determinantal form !/;;| = 0. Taking 
x, = 0, we see that a general cubic curve is expressible in determinantal form. 

I shall prove that every plane curve is expressible in determinantal form and 
that, aside from the cases mentioned above, no further general homogeneous 
polynomial is expressible in determinantal form. 

The case of quartic surfaces was discussed erroneously by qessop. His 
argument would apply equally well to the determinant D whose 16 elements 
are binary linear forms and show that D can be given a form containing: 
single parameter, whereas every binary quartic can be expressed in the form D. 

A new theory of equivalence of pairs of bilinear forms is given in § 9. 

2. THEOREM 1. When the number of terms in the general form of degree r in 
n variables (n > 2) exceeds (n — 2)r?+ 2, it is not expressible as a deter- 
minant whose elements are linear forms. 

* Presented to the Society at Chicago, December 28, 1920. 

Journal fir Mathematik, vol. 62 (1863), p. 265. 

t Ibid., vol. 68 (1868), p. 79. 

§ In case their coefficients are rational we obtain all rational solutions of the Diophantine 
equation f = O by solving our three linear equations for the ratios of x, , 24, obtaining 
cubic functions of x, A, A. 

|| Quartic Surfaces, 1916, p. 160. 
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Let D be any r-rowed determinant whose elements are linear homogeneous 
functions of &, «++, 2. We may express the matrix M of Y in the form 
xv, M,+ --- +2, M,, where each M; is a matrix whose r elements are con- 
stants. Evidently D is at most multiplied by a constant not zero if we inter- 
change any two rows or any two columns, or multiply the elements of any row 
or column by a constant not zero, or add to the elements of any row (or column) 
the products of the elements of any other row (or column) by a constant. 
The effect on M of any succession of such“ elementary transformations ”’ is 
known to be the same as forming the product A.B, where A and B are con- 
stant matrices whose determinants are not zero. 

If the determinant of MI is zero, D lacks i and will not represent the general 
form. Hence Mi has an inverse Mr such that Mi My is the identity matrix I. 


Consider the new matrix 
N = = 22 Nz＋ . . . N. (Ni = M. Mr), 


whose determinant equals the quotient of D by MI and has unity as the 
coefficient of xj. The product IN will likewise have J as the coefficient 
of x, if and only if 4 = . Our next step is therefore to choose matrix B 
so that B N B shall have a canonical form. We first interpret this product. 
If Ne is the matrix of the transformation 


j=l 


and if we introduce new variables 91, , 7, by means of a transformation 


2 Bix Er 

whose matrix is B, transformation (1) becomes a transformation on the 37s 

whose matrix is easily verified“ to be B N B. Naturally we desire that 

as many as possible of the n’s shall be transformed into mere multiples of 

themselves. Hence we seek functions 7 = =8; & such that 7’ = Ay under 

transformation (1); the conditions are evidently 


Bray = N 


Thus d must be a root of the characteristic equation 


X 


. Arr — 


* Dickson, Linear Groups, Teubner, 1901, p. 80. 


| 
| 
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of transformation (1). When X is any root, the above conditions are known 
to have solutions 61. , 8, not all zero, so that n’ = An. 
If in the last determinant we replace x by — x;/.2 and multiply the elements 


of each row by 22, we obtain 


＋ TI 2 Ay 
which is the determinant of matrix x. No ＋ 2, J and hence is the value of D 
when 23 = 0, = 0. The latter must be the general binary form in 
21, 4, since D is required to represent the general form in 1, +++, 2. Hence 
the above characteristic equation must have r distinct roots xi, , M.. 
4 We thus obtain r linear functions i., 7, which (1) multiplies by Xr, 
+++, N, respectively. A simple artiffee“ shows that these 7’s are linearly 
independent functions of E, , E, and hence may be taken as new variables 
to give the desired canonical form 


(2) ni = ini (i = 1. „r). 
Denote its matrix BI Na B by P2. Similarly, denote BN, B by P,. 
Hence our matrix & (and thus ) has been reduced to 

P = xi I + x2 Pa + ---+ 2, Pe 


For n > 2, the further normalization of P is to be accomplished by means 
1 of a matrix K such that K P K = P. But the only linear transformation 
commutative with (2), in which Ri, ., X, are distinct, is seen at once to be 


(3) 11 = kin: (¢ = 1, 9), 


where the k’s need not be distinct, but each is + 0. If 


, * 
ni = 0; (i = 1, 
j=l 


} is the transformation whose matrix is P;, and if K is the matrix of (3), then, 
4 in view of the above interpretation, A~! ; K is the matrix of 
q 

cu (i = 1, „r; ki MN). 
jai 


Thus the maximum simplification possible in P:, ö, P, is to make 1 — 1 
non-vanishing elements take the value unity, so that at most (n — 2) 2 
— (- 1) parameters appear in their canonical forms. Taking account 
also of Mi, , A, and of the factor initially removed from D, we conclude 


* Ibid., p. 222. 


| 

| 

| 

| 

| 

| 
⁰ 


170 LEONARD EUGENE DICKSON [April 


that D can be given a form containing at most (n 2) ＋ 2 parameters. 
A generalization of Theorem 1 is given in § 10. 
3. The number of terms in the general form of degree r in n variables is 
known to be the binomial coefficient 
(r+n-—1)! 


r ri(n—1)! 


This follows by two-fold induction since in 


1 — 1 r r 


the first symbol therefore enumerates the terms with the factor x, and the 
second symbol enumerates the terms lacking 2, . 

4. By §§.2-3, the general form of degree r in u variables (n > 2) is not 
expressible in determinantal form if 


— 1 
(4) 


If r = 2, this condition reduces to (n -- 3)(n—4)>0. If 7 = 3, it is 


n? + 3n? — 52n + 96> 0 


and holds when =5. If r =4 and n =7, we have 


n —1 (n — 1)! 
since we have replaced r by 4 in all but the last two factors. The factors 
n — I, n - 2, 7 (which are absent if n = 7) may be cancelled. We get 


(n+ 3)(n+ 2)(n4+1)n(r+2)(r4+1)/6!, 
which will exceed f = (n — 2)r°+ 2, since (n — 2)(r+2)(r4+1)>f, 


provided 


(n+ 3)(n+ 2)(n+1)n/6!Z2n— 2. 
The latter may be written in the form 
(n — 7) (n*?+ 13n? + 102n) + 144020. 


‘It remains to treat the cases n =6. For n = 3, (J) fails if r = 2, since 
=r+2 for (r—1)(r—2)=20. But for n= 4, (4) holds if 
(r—1)(r—2)(r—3)>0. For n=5 and n= 6, (4) becomes respec- 


tively 


1075 — 3772 + 50r — 24> 0, 
r+ 15r'+ S5 — 2552 + 274r — 120 > 0, 


E 
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each of which evidently holds if r= 3. Hence we have 
THEOREM 2. The general form of degree r in n variables, n 2, is not 
expressible in determinantal form if r= 2 or 3, n> 4, and fr S4, S4, 
and hence unless n = 3, r any, orn = 4,r = 2 or 3. 
5. Since general quadrie and cubic surfaces are expressible in determinantal 
form (§ 1), there remains only the case n = 3. 
; For n = r = 3, we may employ the canonical forms,* omitting those with a 
linear factor (§ 6): 


| ey 2 1 

＋ * + 22 — = 2 X ay, a+-=m-1, 
aly 2 

jz y z x 0 

＋ % — O x yi, ＋ y2=|0 2 

y Of jz O 


I obtained the determinants by inspection. 
For n = 3, r = 4, a general quartic curve can be given the form 


a h ＋ d ＋ ef? — 2abed — 2abef — 2cdef = 0, 


where a and b, e and d, e and f are three pairs of bitangents of a Steiner set. f 
I find that this function equals the determinant 


0 


b 
0 


But the actual determination of the bitangents depends upon the solution 

of an equation of very high order. Also numerous radicals appear in the 

expressions for m and the canonical variables in the above cubic form. The 

5 method next explained is not only general, but employs no irrationals other 

than the roots M, , N. 

i 6. The following method enables us to express the equation f = 0 of any 
plane curve of order r as a determinant of order r whose elements are linear 
functions of x,y,z. It u. -uffice to prove this for irreducible forms f. For, 
if f = fife, where F. is of gree r; and is expressible as a determinant of 
order r; of matrix M., then f equals the determinant of the matrix 


M, O 
0 M,}’ 


where O is a matrix all of whose elements are zero. 


P. Gordan, these Transactions, vol. 1 (1900), p. 402. 
+ Miller, Blichfeldt and Dickson, Finite Groups, 1916, p. 355. 
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We suppose merely that f has no repeated factor. Then there exists a 
straight Jine which cuts the curve in r distinct points.* Take it as the side 
z = 0 of a triangle of reference. Take as the side y = O any line not meeting 


~ 


z = 0 at one of its r intersections with the curve. Then (1,0, 0) is not on 
the curve, and the coefficient of 2* in f may be assumed to be unity. Hence, 


II 


for z = 0, f reduces to a product Ai A X, of r distinct linear functions 
X;=a+X:y. Thus 


k=l 


where F is a binary form of order r — . 
We shall prove that every such form F, in which Mi. , 
can be expressed as a determinant of the type suggested by § 


Niten2 ez: Cir 2 


are distinct, 


15 


(6) 


There are 3 (r+ 2) (FI) coefficients in a general ternary form of order r. 
In (5) we have identified f (, /, 0) with XY, --- A,, thus fixing r+ 1 coef- 
ficients; there remain 4 (r? + r) coefficients. Hence the identification of (6) 
with (5) involves as many conditions as there are ci in and below the main 
Accordingly we shall assign simple values to the remaining ¢;;: 


diagonal. 
(7) 
This choice is in accord with the general theory in § 2, where it was shown 
that, without altering determinant (6), we may assign the value unity to 


Cin = 1, G = 1. r). 


r 1 non-vanishing c’s. 

We proceed to prove that the ch (j =i) can be uniquely determined so 
that determinant (6), subject to (7), becomes identical with any given form (5). 
Use is made of the known expansion of an “ axial” determinant (6). First, 


the terms linear in z are 


Len 211 Xe X. 


i=l 


This sum will be identical with Fi (/, x), where Fi is any given binary form 


For, if every line z = rx + sy cuts f = 0 in points two of which coincide, 


F(z,y) =f(z,y,rz + sy) =0 


has a double root for every r, s, whence 
OF af aF _ af 


or (Ox 92 9% ay 52 


Thus every point on f = 0 is a singular (multiple) point. 


for every r,s. 


* 
...... 
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of order r — 1, if they are equal for the r values x = — A;y (i= 1, - 
for which the A, vanish. The resulting conditions 


en (M1 — M.) Ni) N.) = FL, — X-) 


uniquely determine ch (i= „r). 


Next, the terms of (6) quadratie 
in 2 are 


Ci Cij * r 8 
f 8 TT (i, 1, 1 <9). 
3 ji Ci 
. This sum is to be identified with 2? F (/, T). If >i+ 1, then ci = 0 and 
5 the two-rowed determinant equals the previously determined number ¢;; ¢;;. 
é If 7 = 7+ 1, then ¢;; = 1 and the diagonal term is known. Transposing all 
3 J j 8 8 


the known terms and combining them with 22 Fa, we are to identify 


r—l 


Ci+1, i AI Ai X i+2 XN, 


i=1 


with a known binary form of order r - 2. It suffices* for this purpose to 
take in turn A1 = 0, A2 0, ---, X¥,1=0. Of the resulting conditions, 
the first determines car, the second determines ¢32 in terms of ¢21, ..., the last 


determines ,, i in terms of ¢,1,;-2, so that they uniquely determine all 
of the 

To make the general step of the proof by induction, we assume that the 
1 terms of (6) of degrees 1, 2, „% — 1 in have been identified with the terms 
; of (5) of corresponding degrees by the unique determination of the c, ¢is1,;, 
„chi. The terms of (6) of degree k in z are 


ci Ciyig , 

11 212 21 ‘ 

tke 17 ix 


This sum is to be identified with z* F. (/, ). If in the determinant just 
written every element just above the main diagonal is unity, then 


i: 1, = iz , it = 


2, 


and the difference between the subscripts of any c other than ¢;,;, is 
so that the e is one of those previously determined, while the minor of the 
exceptional e equals unity. We shall prove that the expansions of the remain- 
ing determinants D involve only previously determined c’s. Then after trans- 
posing known terms and combining them with z* F, we have left only the 
product of 2* by 


17 A1 eee 


i+k-l,i ** — 
i=l 4. X 1411 4 i+k—1 


Or we may determine c., i from X, = O and work back half way or all the way. 


4a 
| 
| 
| 
| 
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which is to be identified with a given binary form of order r— . The con- 
ditions obtained by taking in turn XY, = 0, ---, Ai = 0 determine these 
e’sin turn. Hence the induction is complete. 

It remains to prove the statement regarding any determinant D which has 
at least one zero element e just above the main diagonal. Then, by (7), zero 
is the value of every element of D which lies in the rectangle bounded by e and 
the elements above it and the elements to the right of it. Hence D is the 
product of two principal minors (each having its diagonal elements on the 
main diagonal of ). In case either minor has a zero element just above its 
diagonal, it decomposes similarly into a product of principal minors. Hence 
D is a product of two or more principal minors each of which has either a single 
element c or is a principal minor P all of whose t elements just above the 
diagonal are equal to unity. If the diagonal elements of such a (¢ + 1 )-rowed 
P are 

en (i i, n, u, „ 
then, by (7), 


in = 2,+1, in = im $1 1112, * 


so that the maximum difference of subscripts in any element of P ist, — 7; = t. 
Since P has fewer rows than D, I=, andt=k—2. Hence by the 
hypothesis for the induction, every element of P is among those previously 
determined. 

THEOREM 3. Every plane curve can be represented by equating to zero a 
determinant whose elements are linear functions. In particular, any ternary 
form without a repeated factor can be transformed linearly into mf, where m is a 
constant and f(x,y,0) = Ai. A,, Yi „ X, being dis- 
tinct. Then f can be expressed in one and but one way as a determinant 


XM 0 * 0 
C21 2 No Coo 2 coe 0 


. . . . . . 


Cr1 2 62 2 Cr3 2 A. + Crr 2 


in which the elements just above the main diagonal equal z and the remaining 
elements above the diagonal equal zero. 

7. The problem for quadratic forms may be treated with attention to 
rgtionality. A determinant of order 2 or 3, whose elements are linear forms 
in &, /, 2, w with coefficients in a given field F (or domain of rationality) 
evidently vanishes for a set of values, not all zero, in F. Hence a quadrie 
surface which is representable as such a determinant must have a point with 
coérdinates in F, which may be taken to be (1, 0, 0, 0). If the surface is 
not a cone, the coefficient of x may be taken as the new variable y. Then by 


* 
3 
a 
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adding to æ a suitable linear function of /, z, w, we obtain zy + Q(z,w), 
where @ is a binary quadratic form with coefficients in F. A determinant 
representing it can evidently be given the form 


r+A B 
** 

where A is free of x, while B and C are free of x andy. Thus A = 0, and the 
condition is that Q have linear factors with coefficients in Y, so that the 
tangent plane y = 0 at (1, 0, 0, 0) cuts the surface in rational lines. For a 
cone or conic, we note that a ternary quadratic form which lacks x? can be 
transformed rationally* into zy + az* or 0 (/, 2). But every binary form 
can be expressed rationally in determinantal form: 

* ＋ 

| — ay 


a3 
— 
do + y+ +a,y" = — 44 7 


(= 1) 
0 


TuHeorEM 4. Every binary form can be expressed rationally in determinantal 
form. A rational quadrie surface not a cone can be expressed rationally in deter- 
minantal form if and only if it has a rational point the tangent plane at which 
cuts the surface in rational lines. A conic (or quadric cone) can be represented 
rationally in determinantal form if and only if it has a rational point (not the 
vertex). 

If we desire to ignore irrationalities, we take F to be the field of all complex 
numbers. A summary of our results gives 

Tukonku 5. Every binary form, every ternary form, every quaternary quad- 
ratic form, and a sufficiently general quaternary cubic form can be expressed in 
determinantal form. No further general form has this property. 

I have treated elsewhere} the problem of quaternary cubic forms with 
attention to rationality. The number 20 of coefficients equals the number of 
disposable parameters in the determinant (§ 2), and the problem depends in 
general upon the solution of a single algebraic equation. The notations sug- 
gested by § 2 are less convenient for this problem than those used in the 
paper cited. 

8. We shall examine briefly the conditions under which a given rational 
ternary cubic form 7 is expressible rationally in determinant form. If 7 

* Dickson, Algebraic Invariants, 1914, p. 24. 

American Journal of Mathematics, April, 1921. 
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vanishes at a rational point, the problem has been treated fully in the paper 
last cited. In the contrary case, we may assume that the coefficient of 2° is 
unity and that the terms in 2” y and 2 z are lacking. The matrix of the deter- 
minant may be taken to be a2, I + Ne +2N3, where 21 = , = y. 
Since 7' is not zero at a rational point, I + 2 N + O when 2 and 2 are 
rational and not both zero. As shown in § 2, the characteristic equation of N: 
therefore has no rational root, and hence has a single invariant factor 
N — adv — B, so that there exists ($9) a matrix of rational coefficients which 
transforms Ni into P: and N; into* P3: 


0 a B fa 
72 1 0 0), P; = 0 f 
0 1 O/ ghk 
The determinant of «I + yP2 + 2P3 will be identical with 
T = 2° — ary’ + By? + Bayz + Cy? z + Daz + Ey? + Fs 
if and only if 
ate+k=0O, b+ad+6g4+f= —B, c—ak+6d+ph=C, 


ae e f 
g k hk 


b e la c de 


h af I: la +8 


| 0 


g 


Employ the linear equations to express a, h, e in terms of the remaining letters. 

In ) ,gandf enter linearly, the latter with the coefficientd - . Ifh = d, the 

result of substituting the values of g and f given by our equations (D) and 

(L) into , = F is of the ninth degree in each d and e and of the eighth 

degree in . Since this special case thus involves hopeless difficulties, we 

take M =d+m,m+Q0. Then equations (D), (L) and (J) give 
1 y = mB - ak + 38d — C, 


m 
6=D+e+ek+i? — Bd — ad’, 
+ (e+ 2k) + mB(k -e) — ablg — me — 6(e + 2k) = 0, 
ye= L= Bk — C(m+ 2d) - 2adk — amk + 38d? + 38md + Bm’, 


For the normalization of P; we have available the most general matrix 


0 6 0 
10 — 


commutative with P:. But it transforms P; into such a complicated matrix that normalization 
does not seem worth while. 
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—-e-k C— mB - 28d 
md me gy — 6 
g me + md k 


The resultant of our quadratic and cubic equations in g is readily seen to be 
of the sixth degree in , with coefficients involving d, e, / to higher degree. 
Even in the simple case in which T has a rational point, the problem depends 
upon a cubic irrationality. 

9. Two pairs of bilinear forms with matrices M, N and M’, X' with ele- 
ments in a field F and such that N and N’ are non-singular (determinant + 0) 
are called equivalent with respect to F if there exist non-singular matrices P 
and with elements in F such that PMQ = M’, PNQ = N’. Necessary and 
sufficient conditions for equivalence are known to be the identity of the 
invariant factors of M MN with those of M' —XAN’. The special case 
N = N’ =T1 shows that matrix (or substitution) M can be transformed 
within F into Q-' MQ = M’ if and only if the invariant factors of the char- 
acteristic determinants |.M — J) and | M’ — MI of M and M are the same. 
This case finds application in many branches of mathematics. 

The last result has been proved independently* of the theory of pairs of 
bilinear forms. From it we can deduce the latter theory. First, let M, NV 
be equivalent to M’, N’, so that MN = J, I are equivalent to M’ N 
= J’, 1, whence AJB = J’, I, A= B". Thus B transforms J 


into J’ and their characteristic determinants |.MN-! = and |.M’ 
— N have the same invariant factors. The latter are not altered when the 
determinants are multiplied by the constants X and | N’|, respectively. 
lence M= XN and —XN’| have the same invariant factors. This 
necessary condition is also a sufficient condition for equivalence of the pairs. 
For, we then have Bo JB = J’, J= MN“, J’= M’' N"", where the ele- 
ments of B are in F. Then, if X is arbitrary, 


Bo — MT) B = M/ VII- XT, 
Bo (M —\N)N—™ B (M/ —-XN’)N", 
Fi (H- XN) C= MH - XN’, C = MI 


Thus B serves as pre-factor and C as post-factor to convert M into M’ and 
N into N’, so that the pairs are equivalent. 

The ideas in this paper evidently apply also to the question of the equivalence 
of n-tuples of bilinear forms, and are being developed by one of my students. 

10. A generalization of Theorem 1 is given by 

THEOREM 6. Any r-rowed determinant D whose elements are linear homo- 

* Dickson, these Transactions, vol. 3 (1902), pp. 290-2. 

Trans. Am. Math. Soc. 12 
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geneous functions of 21, «++, , can be expressed in a canonical form involving 
not more than (n — 2) + 2 parameters when n > 2. 

The matrix of D is of the form M = a, M, + --- +a, Mn. 

Since we may assume that D is not identically zero, there exist constants 
(1, „ % such that the determinant of C = ei Hi + --- +e, M, is not 
zero. Replacing 21, „ , by 


ei + di tees +++, ＋ d. te 1. „, 


where the d;, --- , V are chosen so that the determinant of these linear forms 
is not zero, we see that M is replaced by æ&10 ... Returning to the 
initial notations, we may therefore assume that |./,| + 0. 

Proceeding as in § 2, we reduce M to an equivalent matrix 


Pa, 


in which P: is the canonical form of a matrix (or linear substitution) obtained 
by transformation. We proved our theorem in § 2 for the case in which Pa 
is of the special form (2) with Mi, , A, distinct. In case equalities occur 
among these \’s, transformations (3) are included among those commutative 
with Pz, so that we can accomplish the same (and further) specialization of 
the parameters in P;, ., P, as in §2. Hence for two reasons the number of 
parameters in P:, „ P, is smaller than before. 
Finally, let the canonical be of its most general form :* 


(8) ni =Am+m;, nz =i h + = NI + Nes 
8 


(9) 
ete. Now (S) is commutative witht 
(10) ni hey ni ＋ ke 741 + + he Ne, 1 „ ei). 
To verify this fact, we may take xi = 0 in (8), since we may subtract A, J 
from the matrix of (8); then the product of the modified (8) and (10), in 
either order, replaces . by 

hey H + he nize + + hei te, - 


Similarly, (9) is commutative with 


Dickson, Linear Groups, p. 223; Bécher, Higher Algebra, p. 293. 

t We do not need the fact that every linear transformation on m1, «++, ne Which is com- 
mutative with (S) is of the form (10). If the x's are all distinct, every transformation commuta- 
tive with Pu is given by (10), (11), ete. 


— 
ͤoP⁰Oÿd 
222 (121, +++, es). 
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Transformations (10), (11), ete. are available for the normalization of 
P, (J >2). Let P, replace 7, by 


(12) tim T +h, me, +f; 


where F is a linear function of the ¢’s, ete. If K is the matrix of the trans- 
formation defined by (10), (11), ete., K 1 P, K replaces % by the product of 
er! by the function by which (10), (11), ete., replace (12), the product being 


1 
f'+ 2 t. (Li ni + H+ ne, ) 


ko ko k; 


ko f’ 
+ ( + + 4400 Ne; 


where f’ is the function by which (11), ete. replace f. Hence the ratios 
2 ‘ky, hy, Ei, 12/ II, /li, 


are available to specialize as many parameters in P, Their number is 
ei Te e. 1 if ei +--- +e,=r. In §2 we could specialize 
only this number r — 1 of parameters in the P, (j > 2). We now have only ¢ 
distinct \’s instead of the former r. Hence the total number of parameters 
found in § 2 is the true maximum. 
THe UNIVERSITY OF CHICAGO, 
Cuicaco, ILL. 


PSEUDO-CANONICAL FORMS AND INVARIANTS OF SYSTEMS OF 
PARTIAL DIFFERENTIAL EQUATIONS* 


BY 


ALFRED L. NELSON 


1. INTRODUCTION 


The theory of invariants of the linear ordinary differential equation dates 
from 1862, when Sir James Cocklef obtained certain seminvariants. In later 
papers he made progress in the theory, while other writers, f notably Laguerre, 
Brioschi, Halphen and Forsyth, extended his results. The theory thus 
developed presents a striking analogy to the theory of algebraic invariants. 
One point of similarity, with which this paper has much to do, is the fact that 
by means of a transformation of the type y = X() 75, the linear differential 
equation 


% ＋ pny? + oy = 0 


may be made to assume a canonical form, in which the coefficient of y“~” is 
zero. The remaining coefficients are then unchanged when the equation is 
subjected to any transformation of the type in question, i.e., they are absolute 
seminvariants. The seminvariants thus obtained are moreover a fundamental 
set, in the sense that any seminvariant whatever is a function of them and of 
their derivatives. 

The work of Wilezynski$ has shown how the previous theory may be ex- 
tended to include completely integrable systems of linear homogeneous partial 


* Presented to the Society, September 2, 1919. 

+ The Correlations of Analysis, Philosophical Magazine, ser. 4, vol. 24, pp. 
531-534. 

t In the first pages of an article ‘ Invariants of the General Linear Differential Equation and 
60 ir Relation to the Theory of Continuous Groups, American Journal of Mat he— 
matics, vol. 21 (1899), pp. 25-84, Bouton gives a brief sketch of these contributions. 

§ Projective Differential Geometry of Curved Surfaces, these Transactions, vol. 8 
(1907), pp. 233-260. 

One-Parameter Families and Nets of Plane Curves, these Transactions, vol. 12 
(1911), pp. 473-510. 

Sur la Théorie Générale des Congruences, Mémoires de l’Académie Royale 
de Belgique, Classe des Sciences, ser. 2, vol. 3 (1910-1912). 
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differential equations. In various papers, he and others“ have obtained 
canonical forms of certain such systems, making use of the transformation 
y =X(u,v)y. As in the theories referred to, the coefficients of a canonical 
form are a fundamental set of seminvariants. Greent has proved that for a 
very large class of completely integrable systems, such a canonical form can 
always be obtained, and that its coefficients will have the usual proper- 
ties. Finally should be mentioned a paper of Wilezynski,t in which is given 
a general proof covering all known theories of invariants, as well as many 
others. 

It is characteristic of all the work just referred to that the actual existence 
of a canonical form has been considered necessary to the proof that the coef- 
ficients of such a form are seminvariants. This has resulted, in so far as partial 
differential equations are concerned, in a failure to obtain canonical forms 
characterized by the vanishing of certain coefficients. Instead, certain func- 
tions of the coefficients have been made to vanish. The seminvariants secured, 
therefore, have not been as simple as might be desired. The author || has 
shown that, for a large class of completely integrable systems, a fundamental 
set of simpler seminvariants can be constructed as the coefficients of a pseudo- 
canonical form. These non-existent pseudo-canonical forms are characterized 
by the vanishing of certain coefficients. The proof used seemed more compli- 
vated than necessary, and one of the aims of the present paper is to furnish a 
simpler one. 

A second purpose is the extension of the theory presented in the previous 
article to include semi-covariants, invariants and covariants. 

Attention will be confined to partial differential equations with a single 
dependent variable, although the facts exhibited will hold for systems having 
two or more dependent variables, provided the transformations used are of 
the type discussed in this paper. It must be admitted, however, that such 


W. W. Denton, Projective Differential Geometry of Developable Surfaces, these Trans- 
actions, vol. 14 (1913), pp. 175-208. 

G. M. Green, Projective Differential Geometry of Triple Systems of Surfaces, Columbia 
Dissertation, 1913. 

G. M. Green, Projective Differential Geometry of One-Parameter Families of Space Curves, 
and Conjugate Systems of Curves on a Surface, American Journal of Mathe- 
matics, vol. 37 (1915), pp. 215-246. 

Linear Dependence of Functions of Several Variables, and Completely Integrable Systems 
of Homogeneous Linear Partial Differential Equations, these Transactions, vol. 17 
(1916), pp. 483-516. 

t Invariants and Canonical Forms, Proceedings of the National Academy 
of Sciences, vol. 4 (1918), pp. 300-305. 

The one exception to this statement is the theory of curved surfaces, discussed by Wil- 
ezynski in the paper referred to. 

|| Note on Seminvariants of Systems of Partial Differential Equations, American Jour - 
nal of Mathematics, vol. 41 (1919), pp. 123-132. 
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transformations (in cases which involve two dependent variables) are not 
always the ones of greatest interest geometrically.* 

Ordinary differential equations will not be considered, for the reason that 
the methods of this paper, insofar as they apply to ordinary differential equa- 
tions, reduce to old methods in such cases. 

Particular attention must be called to a change in terminology from the 
author's paper cited. What were there called pseudo-canonical forms are now 
termed pseudo-semi-canonical forms, the first name being reserved for certain 


new unique forms. 

The author wishes to thank Professor Wilezynski for valuable criticisms, 
which have resulted in an improvement in the form of this paper. He also 
desires to make the following further acknowledgments: Dr. A. L. Miller, in 
setting up a completely integrable system of equations to be used as a basis 


for a special theory, accidentally discovered that seminvariant coefficients 
resulted in that special case when a certain unallowable transformation was 
employed. Acting upon this hint, the author succeeded in establishing the 
results contained in his previous paper. While considering the feasibility of 
the extensions embraced in the present paper, special results of Dr. W. W. 
Denton in the theory of developable surfaces were of assistance. 


2. PSEUDO-SEMI-CANONICAL FORMS AND SEMINVARIANTS 


Let us assume that the completely integrable system of differential equations 
(a) has one dependent variable, /, and n independent variables, i, «++, Un; 
that it consists of p equations, each of which expresses a certain derivative of 
y linearly and homogeneously in terms of certain q primary derivatives (in- 
cluding y itself); that no primary derivative is of higher order than any of the 
left members of the equations (a); that if a given y-derivative, 


occurs in any equation of (a), then all derivatives of lower order, from which 
the given derivative may be obtained by differentiation, are also present in 
that equation. f 

We shall group the various y-derivatives and coefficients of (a) as follows: 


* See, for example, Wilezynski’s basis for the study of congruences of straight lines, in his 
prize emoir, “ Sur la Théorie Générale des Congruences,” I. e. Compare with that used by 
Green, in section 8 of his paper, “ Projective Differential Geometry of One-parameter Families 
of Space Curves etc.,“ I. e. The former employs the transformations used in the present 
paper, while the latter does not. 

+ The purpose of this assumption is to insure that a transformation of the dependent, 
variable shall replace (a) by a system of exactly the same form. Cf. Green, Linear De- 
pendence of Functions of Several Variables, ete., I. e., section 6. 


ohr 
— 
+++ 
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A y-derivative of the same order as the left member of its equation is said to 
be of the zeroth class. A y-derivative of order i less than the left member of 
its equation is of the ith class. A coefficient of an ith class y-derivative is of 
the 7th class. 

The equation 
(la) =A(M, 7 
yields by differentiation 

= ＋ NA 7 


71 


O, UM, po O, lz, „ = O, „ where 


(1b) 


l; 751 
Y. = (0 (J. — pi)! pit? 

When equations (1) are substituted in the system (a) there results a new 
system, (@), of the same form as (a), in which the independent variables are, 
as before, While the dependent variables are 7, (% = n), 
etc. These new variables may be expressed explicitly in terms of /, un, ete., 
as follows: 

(2a) p=l/r, 


Mu; 
Yu; u 2 ＋ „ 


. 


1 2655 Pn 1 7 


with pi, po, „ Pn as in (10). 
A coefficient of (@) of the first class is of the type 


* 
2 Au. 
= 6 ＋ DL pia—, 
ial 
where the p; are integers (including zero) and the a; are seminvariant zeroth 
class coefficients of (a).* 
Let us taket n equations of the type 8 = 0, choosing the coefficients 8 in 
such a way as to permit the solution of this set of n equations for y,,/u (i = 1, 
Cf. A. L. Nelson, I. c., section 2. It was there tacitly assumed that a first-class coefficient 
would be changed (if at all), by the transformation (la), by the addition of a multiple of a 
single such u-fraction. The structure of higher class coefficients of (d) is also discussed in this 
section. 
+ The possibility of this will be discussed in section 8. 
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„%), regarding these fractions as independent unknowns. We denote 


these solutions by (Mu;/M). 
It is of course impossible to find a function, u (,, +++, %), which will 


satisfy the n differential equations 
8. = 0, (c= 1. „n), 


unless certain integrability conditions are satisfied. These conditions are 


0 Mu, 0 u 
0 (2) 52 
Ou; Ou;\ 


Nevertheless, without assuming that equations (4) hold, we substitute the 
solutions @) for in all the coefficients of (i). 

However, since and (7,7 = 1, +++, #7), are two dis- 
tinct functions of the coefficients of (a), there are two independent ways of 


substituting for u u, namely, 
and 


This fact would give rise to a lack of uniqueness, which must be avoided by 
observing the following rule: For any particular coefficient of (@), we must 
decide which of the two possible substitutions for py, u,/p is to be used. Through- 
out the coefficient, that substitution must be used, the symbolic identities (4) being 
employed, if necessary. For example, suppose that a certain coefficient of (@) 
is of the form 

Muy ug ue 


4 


If u = d, uh, = b, then = +’ b, or = bi, +a’ 
Either substitution for u., ντ’ ανq may be chosen, but when chosen, must be used 
for Muy uy us Ms Mu; aNd all other u-fractions in the coefficient whose 
numerators are partial derivatives of That is, supposing that py, 
is chosen equal to a), + a’ , bi, must be replaced by a), whenever hi or one 
of its derivatives appears in the coefficient. 

Having observed this precaution, it is evident that we have formally ob- 
tained u unique form of the completely integrable system (a). We shall call 
this form a pseudo-semi-canonical form (A), of (a), and shall use capital 
letters for its coefficients. It is completely characterized by the n equations 

-,n). The seminvariance of the coefficients of (A4) may 


be established by the following argument. 
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Let us refer to the set of coefficients of the system (a) as fa]. Under the 
transformation (2a) they are replaced by the set [a,], the first class coef- 
ficients of which are of the type (3). [a] and la] may be regarded as the 
coefficients of any two systems of type (a) which are equivalent under the 
transformation (2a). Apply to (a) and (a,) the transformation / = vj, 
which is of the type (24). The coefficients ,] and [a,,] which result have 
the following relations: (i) [a,,] are the same functions of [a, | and %, 
(i =1,-++,n),as[a,] are of [a] and (ii) [a,,] are the same functions 
of [a] and (6 = 1, %), as d,] are of [a] and 

In d, make n coefficients of the first class (see (3)) vanish by a suitable 
choice of (i = 1, +++, „), as functions of [a], regarding these fune— 
tions (denoted by (%)), as independent unknowns. If we substitute 
(%%) for throughout [a,], and observe the above mentioned precau- 
tion concerning cross-derivatives, we obtain unique functions | .1] of the ori- 
ginal coefficients [a]. The functions ] are also unique as a set. 

Make the corresponding first class coefficients of |a,,| vanish. This will be 
accomplished by taking %% equal to the same functions, (%%), of a, as 
the ( are of [a]. But this choice of the v,,/v is equivalent to taking 
(uv ),,/mv equal to the same functions, ((uv),,/uv), of [a], so that the fune- 
tions ((uv),,/uv) and / are identical. The substitution of (»,,/v)’ for 
„%% throughout [a,,] will yield a set of unique functions, [.1]’, of the coef- 
ficients [a]. 

By virtue of the relation (i), the functions | .4]’ are the same functions of 
fa, | and as [A] are of [a] and /. But (»,,/v)’ are the same 
functions of the coefficients [ a, ] as (%% are of the original coefficients [a]. 
Hence the functions ]“ are the same functions of [a, |] as [1] are of [a], 
where e and [a] are the coefficients of any two systems of the type (4) 
equivalent under the transformation (20). 

In view of the relation (ii), the functions A] are the same functions of [a] 
and ((ur)u,/uv) as [A] are of [a] and Since ((ur),,/uv) and 
(%%) are identical, we see that the functions [.1]’ and [4A] are identical. 
Ilence they are seminvariants. 

We note that the number of seminvariant coefficients of a pseudo-semi- 
canonical form is pg — n, exactly the number of seminvariants in a funda- 


mental set as obtained by Green.* They are moreover independent, no two 
arising from the same coefficient of (a). Hence the coefficients | A] are a funda- 
mental set of seminvariants. 


3. SEMI-COVARIANTS 


The transformation (la) leads us to two important sets of equations. One 
set is that by means of which the new coefficients (those of (@)) are expressed 


* Lincar Dependence of Functions, etc., I. e., section 6. 
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in terms of the old ones (those of (a)) and of the transformation function u. 
The other is the set of equations (2), which express the new variables in terms 
of the old ones and of u. There is a parity existing between these two sets 


which, apparently, has not usually been recognized. 

To exhibit one aspect of this parity, let us make in equations (2) the sub- 
stitutions pu,/u = (Mu,/m), (i = „n), which are determined by the 
equations 8; =0, (i=1,-+-,n). In these substitutions, we disregard 


the factor uw. The functions 
(5) Y= 5 V. = Yu; + (Mus 4) 
are relative covariants. We shall prove this proposition by an argument parallel 
to that made in section 2 for the seminvariance of the functions [A]. 

When the transformation (2a) is made, the set of variables 

[y]: y. Yu, ete. ween), 

is replaced by the set [ y, ], given by equations (2). When we apply the trans- 
formation 7 = vy to the systems (a) and (a, ), we get the new sets of variables 
[y,] and [y,,], respectively. If we substitute v,,/y = (. in[y,], there 
results (if we disregard the factor ) the unique set of functions| FT]. Simi- 
larly, the substitution of %% = (vu,/v)’ in[y,,] gives rise to the set of func- 
tions| Y]’. As in section 2, we can prove the following statements, which 
suffice to establish the truth of the proposition: (i) The set T] are the same 
functions (neglecting a factor) of [y,] and [a,] as [T] are of [y] and [a]; 
(ii) Except for a factor, the set []“ are the same functions of [/] and 
((uv)u,/uv) as [T] are of [y] and (/. Hence the functions [T]“ and 
T] are identical, except for a factor. 

An induction proof may be made as follows: We first show that 


(2 (2 — 
4 


In order to do this, let us refer to the definitions of (y.,/u). 
functions arise from n equations of the type (3), we have 


(6) B. = Bg + 1 Poi &gi (2 = 0 
121 


But, since the combinations B, are seminvariants, 


- — 
5. 0 (9 „„ 


the pr a,; being also seminvariants. By use of (3) and (6), we may put these 
equations in the form 


Mu, U. 
1 gt — — = 0 
() 


* 

— 
̃ 
— 
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The determinant of this system does not vanish, since we have assumed that 
the equations (6) are solvable for the (u.,/u). Therefore, 


= (2 — 
4 


As a result of this lemma, we have 


Mu Mu , 
Yu, * 2 


(i 1, 


It is also evident from (2) that Y = n»Y. Now assume that 
Tu, ol, = u Fu, 


From the law of formation of the functions (5), it follows that 


Mi, «++, 
4 Yiu-n, %, 


Mn, Pn 


0 U 


= 


where all summations are extended over r, p2, „ b; bi = O, „ ; 
D. = O, „ %. Hence, by assumption (7), 


11. %, ly, Il, lias 


) 
Tu. 
Pn pi. , l-. 
(tt), | 
— 
0 — a (Hs 
6 Pn 1 
4 > h 
Ku; 7 
1. + (2) 
— 


ALFRED L. NELSON 


which completes the induction proof. 
The form of (5) shows that all the functions there listed would be inde- 


pendent except for the system (a). By means of (1) all y-derivatives are 


expressed linearly in terms of the primary derivatives. But (5) may be solved 
for /, Mr ete., in terms of I, Vi, ete. Hence all semi-covariants (5) are 
functions of those which correspond to the primary derivatives, and of sem- 
invariants. 
A uniquely determined set of functions may also be obtained from the 
coefficients of (@) by the substitutions 
= — (i 1, „ n), 
which are suggested by equations (2). This set of functions will be charac- 
terized by the relations 
Y., = 0 (i „5 „„. 


As a result of the uniqueness of this set, they are semi-covariants. 


J. APPLICATION TO ALGEBRAIC SEMI-COVARIANTS 
We may regard a pseudo-semi-canonical form of a completely integrable 
system of linear homogeneous partial differential equations as a formal analogue 
of the reduced binary form, and of the semi-canonical form of the linear homo- 
geneous ordinary differential equation. Indeed, the suggestion contained in 
the last paragraph of section 3 may be carried out in the theory of binary 
forms. 
When the binary p-ic* 
f = ig + pay yut+ 
is subjected to the transformation 
x=&+n7, Y=; 
he variables. . are changed in accordance with the equations 
the variables, 2”, 2 5 „J, are changed in accordance with the equations 
+ nn . y on (E + nn 1, y? n, 
while the new coefficients are expressed in terms of the old by means of the 
equations 
(9) a. = ao, a; = a; + nao, d%½ = d + 2nay + u a9, ete. 


* Cf. Dickson, Algebraic Invariants, 1914, p. 47. 
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The substitution in (9) of n = — En, suggested by (S), will yield a set of 


independent semi-covariants. 


5. EFFECT OF THE TRANSFORMATION OF THE INDEPENDENT 
VARIABLES UPON THE COEFFICIENTS OF (a) 


Let the independent variables be transformed by means of the equations 
(10) i; = (u,.) (7 = n). 


The effect upon the variables y, /, ete., is shown by the following equations: 


y’ a 722 A 
y=49, Yu; = Vi Yuzu; = Ui Ui Gus 


= U. U, 9. (i +7), 


Us + 3 U. U. Guu + Ui 


UD. U, + Guus (i j), 
(4A), ete, 


ye 1 


where % = 09/0%;, etc., and the % are integers (including zero). An easy 
induction suffices to show the correctness of the last expression of (11). 

It is obvious from (11) that when the expressions there given for /, J, ete., 
are substituted in (a), and the resulting equations collected in a system (d) 
of the same type, a given y-derivative may give increments only to those 
terms of its equation whose y-derivatives yield the given y-derivative by 
differentiation. For example, the new coefficient of i, in any equation of 
(%) will be due to the old %, Hutu, ete., provided these derivatives are 
present in this equation. In this respect, the effects of the transformations 
(la) and (10) are similar. There is this difference to be noticed, however. 
The right member of any equation of (2) will contain all y-derivatives which 
vield by differentiation the derivative in the left member of this equation. 
Of equations (11), on the other hand, there is only one, namely the first, of 
which this is true. For example, in the expression for y,,.,, (7 + j), there 
occurs none of the variables /, V, J, all of which would appear in the 
corresponding equation of (2). 


u, = 
(11) %%% = 
rely 
oof. 
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In order to show more explicitly the effect of the transformation (10) upon 
(a), let one of the equations of (a) be the following:* 


(12) 


n 
—— Ym, ++ My + CY my, m. + 
g= 


where LI, > mt; and where the g-term is typical of all first class terms, while 
the ¢-term typifies all terms of higher class whose coefficients will, under the 
transformation (10), receive additional terms only from the y-derivatives 


Ym, m, mii. My 1 n), 


of order one higher than % ..., m,. 
The last equation of (11) gives us, as the expression for the y-derivatives 
from the e-terms: 


Ym, „ mtl. Moris 


A 
(13) Ym, M4, m,+1 


gr 


where we are assuming 9, which cannot be negative, to be different from zero. 

Suppose that one of the e,-terms receives an increment, by the trans- 
formation (10), from a y-derivative of higher order than that in the e,-term, 
say from y;,,..-,1,. The expression for y;,,...,1, will be found by successive 
differentiation of (13). But since the second term of the right member of (13) 
is a product, repeated differentiation will not make the ym,,...,m, term disap- 
pear unless at least one of the U; and its derivatives are missing from the factor 

94 0, 

Equations (11) show that this would mean that U; would also be absent from 
the coefficient of every derivative in the complete expression (13). There- 
fore, in forming out of (13) the expression for / , , not Only Ym,,.++,m,> 
but ½ +++, As Well, disappears. Hence, if an e¢,-term 
receives an increment from a higher order derivative in its equation, the term 
will likewise receive an increment from the same higher order derivative. 
But this is contrary to the assumption that the ¢-term receives increments 
only from the derivatives in the terms. Therefore the é are unaltered, 
except for a factor, by the transformation (10). 

When the equations (11) are substituted in (12), we obtain the equation 


"Uy 


It is not material to the argument which one of the derivatives of order 1 + Zl; is the 
left member. 


2 n 


n 
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n 
7 


(Vun. . · . 1 . U, U, 


7] 

A yrly 


n 
+ * Eg My 1 L 9—1 9 L 


g=l 


＋ Ymy, U1 * g * U. 


n 


ye 1 yr e 


Upon dividing by Li“ U:“ U., and transposing, we see that 


°°* Ug-1 gti 


1 
1 n g=1 


where 
5 = U /U g° 
All zeroth class coefficients, therefore, are unaltered, except for a factor, 
by the transformation (10). Since they are known to be unchanged by the 
transformation (la), they are relative invariants. 


If we write the 8 expression as 
3 1 
(15) ( (m1 = - 1), 
g=1 


we may take this as typical (except for the factor), not only of all first class 
coefficients of (d), but also of all coefficients, of whatever class, which are 
altered (factors disregarded) only by linear combinations of . (i = 1, ---,n). 
In all such cases, the coefficients in these linear combinations are unchanged, 
except for a factor, by the transformation (10). 

Consider any pseudo-semi-canonical form (A) of (a), characterized by 
the relations B. = 0 (g = I, „n). We wish to observe the effect of (10) 
upon its coefficients. This may be done by direct substitution, but it is 
desirable for our purpose to indicate a different method. 

The coefficients, B,, of (q), whose vanishing characterizes the pseudo- 
semi-canonical form, may or may not be unaltered by the transformation (10). 
In the former case, the pseudo-semi-canonical form will be preserved, and 
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equations (14), which describe the effect of (10) upon the original coefficients 
of (a), will serve also to indicate its effect upon the coefficients of the pseudo- 
semi-canonical form (.4). We need only read A for a, B for B, ete. 

In the latter case, the pseudo-semi-canonical form is violated, and must be 
restored by a second application of the transformation (la). Equation (15) 
shows that the undesired increments, which must be removed from the B,, 
are at worst linear combinations of 7;, ---, 7, with relative invariant multi- 
pliers. In this second application of (1a) to the form ( A), unn, „ u, 
must be replaced by linear combinations of the kind just described. 

We recall that the coefficients of (@) are equal to the corresponding coef- 
ficients of (a) plus linear combinations of u-fractions with multipliers taken 
from among the original coefficients of (a). Hence, since we are applying 
(la) to a form of (a) whose coefficients are seminvariants, we see that the 
coefficients of (.1) must, certain factors neglected, equal the corresponding 
coefficients of () plus functions of seminvariants and of the m1, ---, 7, and 
their derivatives. 

The foregoing paragraphs throw some light on a question which arose in 
the author’s previous paper.* Certain pseudo-semi-canonical forms in the 
special cases there discussed, not only yielded simpler seminvariants than did 
the classical process, but also produced a relatively large number of semin- 
variants which were relative invariants as well. 

Equations (11) show that the coefficients of the primary derivative y will be 
unaltered by the transformation (10), except for a factor. For this reason, the 
system (d) is usually much simpler than (1). However, when a pseudo- 
semi-canonical form (.1) has been used which is not violated by the trans- 
formation (10), the systems I) and (a) are identical, and we may be sure 
that at least the coefficients of / in (I) will be relative invariants. 


6. PSEUDO-CANONICAL FORMS AND INVARIANTS 

Equation (15), factors neglected, is ‘typica! of all coefficients of (Ii) which 
have as increments only linear combinations of 1, +--+, Let us choose f n 
such coefficients and form the equations B, =(0,(g=1,---,n). Assuming 
that the B, have been so chosen as to permit, we solve the equations for 
m1, °***, Mm, regarding these as independent unknowns. When these solutions, 
denoted by (m), «++, (mm), are substituted for the 's throughout (4), we 
obtaip (formally) a form (A) of (.1), whose coefficients are unique except 
for certain factors C --- UL". We shall call this form, which is charac- 
terized by the relations B. = 0 (g = I, „n), a pseudo-canonical form 
of (a), and denote its coefficients by German capitals. By an argument 


* A. L. Nelson, I. c. Cf. section 6. 
+ Cf. the discussion in section 8. 
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analogous to that used in section 2, we are able to prove that these coefficients 
are unchanged, factors neglected, by the transformation (10). Since, in 
addition, the coefficients of (A) are functions of seminvariants and of 7, 

„ yu, and the (m), , (%) are seminvariants, it follows that the coef- 
ficients of (A) are relative invariants. 

The invariants thus formed are in number 2n less than the total number of 
original coefficients of (a). That is to say, our set is n short of the number 
required for a fundamental set. Those we have are evidently independent, 
and it remains to indicate how n more may be formed, independent of each 
other and of those already obtained. These supplementary invariants may 
be found by a device exactly analogous to that employed by Wilezynski.* 

In section 2 it was proved that 


(5 (“) __ 
4 1 


In an exactly similar manner, it may be shown that, except for a factor, 


(16) (It) = (ut) — 1. (1 1 ). 


The second equation of (14) shows that a, (7, ) must be transformed by (10) 
in accordance with the equations 


1 
= Las (%) + (g = 1, „n). 


Substitution of the first equation of (14) gives 


1 
(17) (%) 


for all (%) which arise from first class coefficients. Similarly, the third and 
fourth equations of (14) prove that for all other (,), the same factor, 1/ U, 
occurs. Hence, combining (16) and (17), we see that the complete expression 
for (It) is 


1 
(ni) = - 14 (121% ). 
1 


Choose any other relative invariant, 0, where 


By differentiation, we obtain 


9. = Gy, . U, 6] 


* One-parameter Families, ete., I. e. Cf. equations (25). 
Trans. Am. Math. Soc. 13 


771 
90 = Ui“ U. 0. 
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1 yell 7711 go le rrve 
= U.. ＋ Ui“ ... Ui U. Uist U. 8) 
05 
so that 


(i 


The functions 
(i 1. „ n), 

will be n new relative invariants, evidently independent of each other. They are 
also independent of the invariants which are coefficients of the pseudo-canonical 
form, because none of these coefficients possessed any of the (7;) as leaders. 

We recall here general principles underlying the formation of the coef- 
ficients of the pseudo-canonical form (A). Neglecting factors, the zeroth 
class coefficients of (A) are equal to the corresponding coefficients of the 
pseudo-semi-canonical form (4) from which (A) was formed. The non- 
vanishing coefficients of the first class are linear combinations of first class 
seminvariant coefficients of (AH), the coefficients in these linear combinations 
being numerical multiples of zeroth class invariants. Each of the invariants 
of any particular class higher than the first, is equal to the corresponding 
seminvariant coefficient of (4) plus combinations of lower class seminvariant 
coefficients. 

Hence, in the set of invariants, including the n invariants 


(18) J. (u. + (i 1. n), 


we may solve for the corresponding semin variant leaders. As a result, any 
invariant whatever of the original completely integrable system (a), being 
known to be a function of the fundamental seminvariants and their derivatives, 
is seen to be a function of the invariant coefficients of (A) and of the invari- 
ants (18). Therefore these invariants are a fundamental set. 


7. COVARIANTS 


The semi-covariants (5) may be regarded as the variables of the pseudo- 
semi-canonical form (.4). Under the transformation (10), they will be altered 
in accordance with equations (11), reading Y, Fu, etc., for /, yu;, ete. We 
may solve these equations, so as to express 7 ; 8 , ete., in terms of T, Y,,, ete. 

Hqwever, if the pseudo-semi-canonical form is violated by the transforma- 
tion (10), it must be restored by a transformation of the type (la). In other 
words, equations (2), with ue u, „ un so chosen as to put the system (A) 
in the pseudo-semi-canonical form again, must in such cases follow the trans- 
formation (10), in order to give in final form the effect of (10) upon the semi- 
covariants (5). This will give us expressions for Y, a , ete., as functions of 


194 
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the semi-covariants (5), the seminvariant coefficients of (AH), and of U;, --- 

The substitutions 7; = (u.); (i = 1, ---,”), which produce the pseudo- 
canonical form (A) of (1), will also give a unique set of equations which 
express F, Wea: ete., in terms of T, Y,,, ete. Neglecting the factors Li“, 

* 3 , let us call these expressions Y, Nu, ete. They are relative covariants, 
and are characterized as a set by the relations B. = 0 (% = I, n), 
which are characteristic of the pseudo-canonical form (A). 

Those of the Y, Nu, ete., up to a certain order, which is determined by the 
left members of the completely integrable system (a), are evidently inde- 
pendent. All covariants are functions of these independent ones and of invariants. 

It is sufficient merely to remark that fundamental sets of simultaneous 
invariants and seminvariants of sets of completely integrable systems of 
partial differential equations may be constructed by the method outlined in 
the preceding sections. Examples of such sets, of which the simultaneous 
invariants might be of great interest, are suggested by two theorems of Koe- 
nigs* concerning the perspective plane nets of the asymptotic lines and of 
conjugate systems of curves on curved surfaces. Further examples are 
furnished by the Laplace suite, as applied to plane nets, f to conjugate nets of 
curves on surfaces, f and to congruences of straight lines.§ 


S. CONCERNING THE POSSIBILITY OF SECURING A PSEUDO-CANONICAL FORM 


‘ 


It must be emphasized that the “*reduction”’ of a completely integrable 
system (a) to a pseudo-semi-canonical form is possible only when there are 
present n first class coefficients of (@) of the type (3). Moreover, even after 
such a pseudo-semi-canonical form (4) has been obtained, the further“ re- 
duction ” of (4) to a pseudo-canonical form can be accomplished only when 
we have n additional coefficients of (1), of the first and higher classes, of the 
type (15), not corresponding to the n coefficients of (@) previously used. 

It would be very desirable, therefore, to show that the situation just de- 
scribed always arises. Unfortunately, this very general conclusion cannot be 
established. On the contrary, an important example is at hand to prove the 
‘impossibility of such a conclusion. We are, however, able to show that for a 
large class of completely integrable systems, the method with which this paper 
has concerned itself, may be used. 

*Paris Comptes Rendus, vol. 114 (1892), pp. 55-57; p. 728. 

F E. J. Wilezynski, One-parameter Families, etc., I. e., section 4. See also A. L. Nelson, 
Quasi-periodicity of Asymptotic Plane Nets, Bulletin of the American Mathe- 
matical Society, vol. 22 (1916), pp. 445-455. 

G. M. Green, Projective Differential Geometry of One-parameter Families of Space Curves, 
etc., I. e., section 6. 

E. J. Wilezynski, Sur la Théorie Générale des Congruences, I. c. 
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Suppose that the highest order derivative occurring in the completely 
integrable system is of order %, and that all kth order derivatives of / are 
present. For the purpose of our discussion, it is immaterial whether all the 
kth order derivatives are the left members of the equations of the system, or 
some are primary derivatives. There are in all“ 


(n ＋ 1) E —1 
nt 


of these partial derivatives of order /. Among the cross- derivatives of this 


order will be a certain number of the type 
( 

(19) 

which have been obtained by one and but one differentiation with respect to 

certain of the independent variables, u,, u,, ue, ete. Each such derivative 

must yield an increment to the coefficient of every derivative of lower order 

out of which (19) comes by differentiation. In particular, the coefficients of 


the first class derivatives 


cee, Oe , Gly. cee, 1. 0, leg e, bys 
(20) 


in the system (@), will receive, respectively, the increments 
(21) un, / u, un, u, ete. 


Let us try to count the derivatives of the type (19), listing them under n 
heads, according as , U2, +++, Or ½, is the independent variable which is 
represented just once in (19). Except for duplications there would be n - H=! 
of these derivatives. However, when a certain derivative would be listed 
under a number of these heads, that single derivative will yield suitable incre- 
ments to the same number of first class coefficients. Hence there are exactly 
„II; increments of type (21) to coefficients of first class derivatives (20) 
from /-th order derivatives (19). 

For n = 2, this number is exactly 2, for all values of . For k = 2, it is 
equal to nn — 1). Moreover, Higi increases with either n ork. Therefore, 
for u >1,k > 1, there will be at least n coefficients of the first class of (dq), 
of the required type, which will serve to determine a pseudo-semi-canonical 
ford. Equations (1) show that such a pseudo-semi-canonical form will be 
undisturbed by the transformation (10). 

It must be remembered that, in general, the n - H coefficients we have 
considered are by no means the only possible ones for our purpose. Deriva- 
tives of type (19) may yield suitable increments to other coefficients than those 


+ Chrystal, Algebra, Part II, sec. 10, p. 10. 
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of type (20). Moreover, kth order derivatives of other types than (19) may 


give rise to proper increments. 
Having obtained a pseudo-semi-canonical form (4), determined as outlined 
above, our next concern is with the system (4). But, as has been remarked 


in section 5, it will suffice to discuss (4). Consider the n “ straight deriva- 
tives 


Yuk (J n), 


of order , in the system (a). Each of these will give an increment, — % 7;, 
to the coefficient of %% in its equation. None of these straight derivatives 
have been used in the determination of the pseudo-semi-canonical form. 
Hence they may be made use of to secure a pseudo-canonical form. 

In addition to the coefficients of (A) suggested, we have available, in gen- 
eral, certain suitably incremented coefficients of (A) which come from cross- 
derivatives. Indeed, the cross-derivatives (19) vield increments of the proper 
type to certain first and higher class coefficients of (A) which have not been 
used in the determination of the pseudo-semi-canonical form. 

The above discussion shows that one or more pseudo-canonical forms may 
surely be obtained in completely integrable systems of the rather regular type 
assumed, provided all the kth order derivatives are present. Indeed, especially 
for larger values of n or , we are assured a large freedom in the choice of the 
2n coefficients whose vanishing characterizes a pseudo-canonical form. This 
indicates that pseudo-canonical forms may be obtained in a great number of 
less regular completely integrable systems. 

In cases where some of the kth order derivatives are primary derivatives, 
however, one or more of them may be removed by a preliminary transformation 
of the type 

= bi (1, Un) n), 
which refers the configuration to a particular set of parametric curves. If a 
sufficient number of these primary derivatives are removed from all of the 
equations of the completely integrable system, the number of suitably incre- 
mented coefficients of (@) or (4) may become less than n, so that the pseudo- 
semi-canonical form, or the pseudo-canonical form, or both, cannot be obtained. 

An example of such systems is the one used by Wilezynski* in the study of 
curved surfaces referred to their asymptotic lines. It is only possible, in 
this case, to obtain a pseudo-semi-canonical form, which coincides with his 
canonical form. 

Tue UNiversity OF MICHIGAN 
E. J. Wilezynski, Projective Differential Geometry of Curved Surfaces, I. e. Cf. equations 
27), (37), (88) and (46). 


ARITHMETICAL PARAPHRASES* (II) 
E. T. BELL 


IV. THE FUNDAMENTAL SERIES FOR PARAPHRASE 


1. A former paper under the same title is continued, We first collect a 
few Fourier developments in a specially prepared form immediately suitable 
for paraphrase, and then apply the theory of the preceding paper to read off 
from them a few of their innumerable arithmetical consequences. The classi- 
cal expansions are not in the appropriate form; we require the arithmetical 
developments in which the coefficients of the powers of ¢ are given as explicit 
functions of the divisors of the exponents. These developments are unique. 
The chief elliptic theta series for paraphrase fall naturally into two sets accord- 
ing as they do or do not explicitly involve class numbers in their coefficients. 
This paper is concerned only with the latter kind and their more immediate 
paraphrases; but in all work like the present with the arithmetic of elliptic 
functions, these series of the first set appear to be fundamental, presenting 
themselves repeatedly in the most diverse investigations. Hence we shall 
give a fairly representative selection from them. 

In writing down the few paraphrases of this paper we have aimed merely 
to show how such lists of properly prepared formulas may be used, much as a 
table of logarithms in other computations, for the almost immediate discovery 
of paraphrases broadly of the Liouville kind. We have purposely omitted 
all applications to specialized functions and their related theorems, the method 
of deriving special results being sufficiently evident from the papers of Liouville 
and Pepin, and from Bachmann’s book. In connection with the series only 
brief notes on the calculations, in all cases simple, have been retained; but 
there are sufficient indications of the course followed for all the expansions 
to be quickly rechecked if desired. 

2. The , n, 2*, d, 6, Ti, T2, Ta notation, explained in §7 of Part I, is 
usedythroughout; and in the elliptic or theta series the summations refer to 
Read before the San Francisco Section of the American Mathematical Society, October, 
1918. To save space, the extensive lists of theta and other expansions contained in the paper 
is read, have been omitted. Some of these, for the doubly periodic functions of the third 
kind, will appear elsewhere. Only the series necessary for illustrating the method of para- 
phrase in V have been retained. 

F These Transactions, vol. 22 (1921), pp. 1-30. 
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all values > 0 of the type 71, T2 or Ta indicated, the coefficients of the several 
powers of being written after those powers in (), { } or [ ] as convenient. 
The d, notation is that of Jacobi (Werke, vol. I, p. 501), with do in place of 
his 9. After glancing at the notation in §§3, 4 here and Part I $7, the 


reader may pass at once to § 12, using §$ 3-11 only for reference. An im- 
portant desideratum in regard to the series is pointed out in § 7, footnote. 

3. Numerical functions. The constants occurring throughout this kind of 
work, other than class numbers, depend chiefly upon the functions now 
defined. Let , (n), €)(n), Cr (n) denote respectively the sum of the rth 
powers of all, of the odd, of the even divisors of n; and let En (n) denote the 
excess of the sum of the rth powers of all divisors = 1 mod 4 of n over the 
sum of the rth powers of all those = — 1 mod 4; also let E, (n) denote the 
excess of the sum of the rth powers of all divisors of n whose conjugates are 
= 1 mod 4 over the sum of the rth powers of all those whose conjugates are 
= — 1 mod 4; and define E, (n) by the identity E (n) + E (n) = E, (n). 
Write 


do (n), do (n), fo (n), So (n), (n), (mn) (n), s“ (n), 


denoting the respective numbers of divisors pertaining to the six classes defined 
by the functions. For convenience we introduce six further functions 


ar (n) = (n); N (n) 
ur (n) = + (n); 
8, (n) = 4K, (n) E, (n); = FF (n) - (- 1) (n); 
or (n) = (n) — § (n); 


and as before write ag(n), ---, po(n) =a(n), , p(n). OF these, 
a,(n) is seen to be the sum of the rth powers of all those divisors of n whose 
conjugates are odd. The equations for the rest express frequently occurring 
functions of the divisors which it is unnecessary at this point to define verbally. 
All twelve functions ¢,, , o, will be recognized as those which first present 
themselves in the principal theorems concerning representations of n as a 
sum of 2, 4, 6 or S squares; and in the simplest applications of the paraphrases, 
such as those arising from f(a|) = 1, 42, a4, ---,g(|x2) = 2, 2°, 2°, for 
all values of x, they reappear in many investigations, including those for 
3,5,7,9, 11 or 13 squares.“ In reductions of formulas the following imme- 
diate consequences of their definitions are most frequently useful. 


* The treatment of these odd numbers of squares is giveninthe American Journal 
of Mathematics, vol. 42 (1920), pp. 168-188. 


= 
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m = — 1 mod 4, E(m) = 0; 
(1) 


m= +1 mod 4, (n) = (— 1)? (m); 


E. (m) = 0 or 28, Cn) according as m = + 1 or — 1 mod 4. 
For n = 2*m, m = db, a = 0; (2)-(7) 


+ = & = - 1] (m)/(2" 1); 
= &(m); 

(27-1) = 2°(2" 1) (n); 
= 2 (n) = (nm) + (n) =)), 
= gun); = 261 (n) = (2n) — (20); 
+ = (2n) — (2n) = — pi(2n); 
d = (- 10 (m); 

= E(n) = E(m); 


and for n = dé, 
(8S) 1)? n) —X(n). 


To emphasize once more the notation, which will be followed in all subse- 
quent lists, the = in (4), (7), (8) refers to all divisors d, 6 of the indicated types, 
here 72 for (4), (7), and 73 for (S). 

4. Theta series and constants; 3,(x) = d (, 4), 3. = . (0). 


(9) =14+2>5(— 1)" cos 2nz, 1+2>0(-1)"q"; 
51 () = 2L ( — 1) sin mz, 92 V3 
2L mg; 
(11) d(x) cos me, 2 = 2) 
(12) = 1+ 2309" cos 2nz, 1 2 
With but a few exceptions which can be derived from the others by means 
of the transformation of the second order, all of the series for d 3% in which 


(a, 6) = (2,3), (0,3), (0, 2) and (a,b) = (1,1), (2,2), (2, 4), 
(4,2), (3,3), (4,4), (0,2), (0,4), (0,6), (0,8), can be simply 


(10) 


found, from the series for the , k’, K constants and their powers as given by 
Jacobi in 58 40-42 of the Fundamenta Nova. For the rapid and systematic 
use of the method of paraphrase the series for all of these constants will be 
found indispensable. The coefficients of all are expressible directly in terms 
of the numerical functions defined in § 3. Here however we need give only 


the following selection: 
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(2) 

(3) 

(4) 

(5) 

(6) 

(7) 
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Jo 2 90 Js =1+ 4 (- 1)" (n); 
Jo 02 = 1) % (m); 
9% (- 1) E (n); 62 = E(m); 
=1+4> qréi(n); 
and the following essential constants in reduced form from Jacobi, J. c., § 41: 
un); B= 4 1) (n); 

C=1+ (un); 

B＋ C= I- = 1) (n); 4-3 = 16 LV m(n); 


of which C is 4K E/, and all are required in the derivations of the series 


(13) 


(14) 


(15) 


(16) 


in § 6. 

5. Eighteen doubly periodic theta quotients.* We shall give explicitly only 
those of the eighteen which can not be derived from others by changes of 9 
into - / and æ into x + 7/2, or by the transformation of the second order. 


(17) (r) = do (/ (x) = 4309"? (> sin dx), [71]; 
(x) = (0 / (K) 
= ese x + (- 1)" cos dr], IL Taz]; 
go (x) = Io 03 (/d) 
1)% cos de], [ Ta]; 
1 (1) = 8 62 (70/01 (4) 
cot ＋ 4 (- 1)“ sin 2d], [ 73]; 


(18) 


(19) 


(20) 


in which, as always henceforth, the T.] indicates the type of division for the 
m or n in the exponent. Note particularly that when the exponent is en, e 


being a numerical constant, the type refers to the divisors of n, and not of en, 
and so in all similar cases. 

It will be sufficient here to indicate how the remaining fourteen functions 
in Hermite’s list (which includes all of the doubly periodic functions considered 
by Jacobi, and six others), may be derived from these. In (21) the first 

* Calculated (with corrections) from the equivalent set given by Hermite, Sur les Théorémes 
de M. Kronecker, etc., Oeuvres, vol. 1, p. 243, or Journal de Mathématiques 
pures et appliquées, (2), vol. 9 (1864), p. 145. The suffixes correspond to the order 
of the functions in Hermite’s list. For the connection of these with the doubly periodic 
functions of the second kind (5 10), cf. Messenger of Mathematics, vol. 49 
(1919-20), p. 81. 
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member in each triad of functions is transformed into the second by the 
substitution x ~ a + 7/2, and into the third by gq ~ - q;i =V—1. Thus, 
carrying out the indicated transformations we find from (17) and the first 
triad the developments of 


3 (r) = de () 
upon changing x into x + 7/2 in (17), and 
Gs (r) = Bo do (4 /oͤz (&) 
on replacing q by — q in (17) and reducing. Similarly for all fourteen. 
(di, , ts), (S, — di, ihr), (Gr, — Gs, tbs); 
(Sz, G, G6), (641, — Ge, Os), (06, Os, G:), (s, — $6, G4); 
(So, G10, G10), (G10, 69,69), (Gu, G1, G11), (12, 11, G12); 
(dis, Gis), (Gis, — G18, G16), (bis, — Gis, G13), (G16, — P15, G14); 
(n, — en, — n), (Gis, — dis, is); 


with which we need for the last two, 

(22 Gn (r, q) = 261 (27, ), ois (&, 0) = 262 (2, ), 

which, upon reducing the functions on the right by the transformation of the 
second order to functions of , %, yield Hermite's forms. It may be noted 
that the set yields no further quotients of the same kind under any of these 
transformations. 

The derivation of the expansions (17)-(20) is sufficiently evident from 
Hermite’s detailed similar calculation in the supplement to Lacroix’ Calculus, 
reproduced in Oeuvres, vol. I, pp. 219-220. Hermite’s remarks regarding these 
fundamental developments are so apposite in the present connection that we 
quote them. “Il est impossible de ne pas étre frappé du caractére arith- 
métique de ces expressions ( sin dr, etc.); elles offrent un exemple des fonc- 
tions numériques qui ont été le sujet des belles recherches de M. Liouville, 
et la maniére simple dont elles sont amenées par la théorie des fonctions 
elliptiques peut aisément faire présumer le réle de cette théorie dans l'étude 
des propriétés des nombres.“ It was from precisely this observation that the 
(probably) true origin of Liouville’s general formulas first became evident: 
in fact his first is equivalent to the identity [()]? = Gi (r) X (2). 
Other’ of his formulas (of which we shall omit discussion) follow from equally 
simple identities, such as 


= Go, Where ¢; denotes the x-derivative* of (). 


For the complete set of these in reduced form, ef. Messenger of Mathematics, 
vol. 47 (1917-18), p. 55, where are also given the calculations for § 6. 
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6. Squares of the eighteen o;(2). All may be inferred from the following 
by means of (21), (22). 
(23) (x) L/ — SLV d cos dx), [ Ta]: 
(24) (r) = 4 u, (n) + ese? SLV (d cos 2dr), [Ta]; 
(T) = 14+ SLV (n) + SL ( d cos 2 dx), [ Ta]; 
(26) Sir (r) = — S pi(n) + cot? — ( La cos 2dr), [ 73]. 


For the verification of these, (15), (16) will be found necessary. 

7. In explanation of the unusual forms of these expansions which, if con- 
sidered only for their use in the customary applications of elliptic functions 
are of slight significance, it will be well to state here what are the desirable 
characteristics, from the standpoint of paraphrases, for such series and identi- 
ties between them to possess. Consider for example 


205 (x) bi () () = (22), 


which is easily seen to be true. On the left we have the product of seven 
known series, viz., (r) counted four times, by (), (), (r); 
while on the right we have the product of only four, or, if the transformation 
of the third order be used to give the series for 3 only two. Or, the left 


may clearly be read as the product of six in three ways, each of Gi ¢7, ¢7 G0, 
¢y Gi being known from the derivative ¢;(2). Hence, reading the identity 
in the first way, we shall get a paraphrase connecting Lfunctions of parity 
(01) integrated over two separations of degrees 7 and 4 respectively, or 
7 and 2; in the other ways the degrees are 6, 4 (three times) and5,4. There- 
fore since in these the separations of degree 4 or 2 are the same, we have a 
syzygy between integrations over separations of degrees 4 (or 2), 5, 6, 7. 
Thus it follows that these arithmetical facts connected with the separations 
of degrees 7, 6, 5 (which relate respectively to representations in quadratic 
forms containing 7, 6, 5 indeterminates), may be reduced to others con- 
cerning separations of degrees 4 or 2 (which relate to representations in quater- 
nary or binary quadratic forms), and the theorems corresponding to the 
specializations of the Lfunctions, similarly reduced. This course has obvi- 
ously been followed, for simpler identities, by Liouville in his 17th and 18th 
memoirs; also elsewhere. Our object then is to find, where possible, simple 
expansions for fairly complicated functions, in order to reduce complex arith- 
metical relations to others which are simpler. The greater the complexity of 
the function which is reduced to a simpler product of known series, and itself 
expanded in a power series in q, the greater will be the variety and interest 
of the paraphrases. The most desirable case is the reduction of a product of 
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many theta factors to a single series whose coefficients are given as explicit 
functions of the real divisors of the exponents of q .* 

8. All possible cases of the paraphrase of identities involving tangents, 
cotangents, secants or cosecants are covered by the sixteen formulas following. 
They may be verified by inspection on multiplying throughout by sin x in 
the cases of the cotangents and cosecants, by cos 2 in the others. The suffix 
in u (r), «++, % (r) is even or odd according as the integer in () is even or 
odd; and where necessary to indicate both variables, the functions will be 
written (2n,a), +++, w7(2n,2). 


1 (2n) = sin ese x 


u,(m) =sin mx ese 


½ (2n) = sin An sec x = — 1)’ sin (271 — I) æ, 


n 
P= 


( 
1 
(m—1)/2 
u3(m) =sin mz see x = (—1)% [tan a+2 L (1) sin 


r=1 


n 
ro (2n) = 2nx ese x = ese x - 2 sin (27 — I) æ, 
rl 
(m—1)/2 
n) = mx ese x = cotx —2 D> sin 2rz, 
ral 


12 (2n) = 2nx see x 


=(- 1) | sex 1)" (2 1)2], 


(m—1)/2 
3 (n) = mx see x = (— 1) +2 T (1) cos 


ra) 


n—l 
wo (2n) = sin Au cot x = 1 + cos 2nx + 2 L cos 2rz, 
r=1 


(m—1)/2 


(m) = sin mx cot æ = cos mx +2 cos (27 —1)z, 
r=1l 


we (2n) = sin àAnæ tan æ 


n—1 
= +(—1)" cos 2nx + 2>> (- 1)’ cos 
w3(m) = sin mz tan x 


= (— sec * ＋ 1)? cos mz 


(m—1)/2 
+2 T (—1)*" cos (27 — 
r=1 


In particular much light would be thrown on the arithmetic of quadratic forms in n inde- 
terminates by the corresponding developments of 


(21 + T2 °°: + 2, (21) (22) z (T1), 


n 
= 2 cos — 1)z, 
rel 
(m—1)/2 
aul 2 cos 2, 
721 
(27) 
2895 
—— 
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—1 
cos 2nzx cot x = cot æ - sin 2nx — 2 L sin 2rz, 


r=1 
m—1)/2 


; mx cot x = eser sin mz — 2 >> sin (27 —1)z, 


721 

s 2nz tan 
(30) 
=(- tan a+(—1)"sin ＋ 2 TL (— 


r=l 


16% (n) cos mx tan x 


(m—1)/2 
= sin mz — 2(— 1) (- 1) sin (2r—1)z. 

(28)-(30) are connected by many simple and interesting group relations 

which, as they lie off the main course of this paper, we omit. We may notice, 

however, a theorem of which the proof presents no difficulty, and which often 

either gives additional paraphrases or affords a check on the reductions of 

theta series. Let h(a) denote any one of sec 2, csc x, tan 2, cot x, and A, 

B quantities independent of 2. Suppose that over some separation the 
following is an identity in æ, 


() + B+ Lat (ner) = 0, 
in which ¢ represents either sin or cos. Then this implies 


B+ t (nr) = 0. 


9. We frequently meet expressions involving secants, ete., of several vari- 
ables to be paraphrased, An example will make clear the procedure in all 
such cases. Writing z = 2 + , we have 


sin (mx + 2ny) ese (2 + y) = sin{(m - 2n)a + 2nz}ese 2 
= %(2n, 3) sin (mn — 2n)a + ½ (2n, z) cos (mn — 2n)z, 


and each term is in a form suitable for paraphrase. 
10. Doubly periodic functions of the second kind.* From the standpoint of 
paraphrase, these functions are of the highest importance. They not only 


where a, 6, y, „ 6, are any of the numbers 0, 1, 2,3. The case n = 2 is considered 
below, §11. Note that, as pointed out by Glaisher (Messenger of Mathematics, 
vol. 14 (1884-85), p. 162), these, the arithmetical developments, are unique, while the analytical 
representations are not. 

*The nomenclature is that of Hermite. The doubly periodic functions of the third kind 
also are of great use in this subject. They are particularly valuable in the derivation of new 
and generalized class number relations. As the forms of these functions due to Hermite, 
Biehler, Appel and others can be changed to their arithmetical forms only after many reduc- 
tions, we shall give the appropriate developments elsewhere. 
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include the doubly periodic functions as limiting cases* in one direction and 


give rise to a great variety of doubly periodic functions of the third kind in 

another, but they also afford us the first and most essential link connecting 

paraphrases in which the order of the functions is unity with those in which 

the order exceeds unity. It is of interest to note that equivalents of the series 

given presently can easily be derived from Jacobi’s investigations on the rota- 

tion of a rigid body and were, therefore, most probably familiar to Liouville. 
Consider the function 


Papy (x4 ’ * 2) 91 94 (1 + (21) (22) ,= G8 


and denote by S, G, PI O, (i, j = 1, 2) the results of replacing 2;, x; by 
a; + 2/2, x; + r7r/2 respectively, where 2 has the usual meaning in terms of 
the half-periods; and let P, P, S, P, denote the application of the sub- 
stitutions P., S., P;, P;, «++ in the order last written. Construct the sub- 
stitutions (which do not form a group) c. (i = 1, 2, ---, 16): 


= S2, a3; = S! S2, = 
= §, P,, og = Si Po, = 8.8, Po; 
Si Pi, Pi, = SI Pi; 
Sz Pi Pa, = 8,8, Pi Pa, S, Pi Po; 
and apply them to Gio, giving:T 
$1 = 2 = 203, 
os = bs = 
= = $320; 
os = $i, = 212. 


It may be verified that, disregarding signs, these sixteen functions are all 
those that can be generated from any one of them, ¢;, by successive appli- 
cations of Sr, S2, Pi, Pz. Hence from the series for Gn may be written down 
those for the remaining fifteen, and no others, by this process of transformation. 
The paraphrase interpretation of Sr, Sz has been considered in Part I, § 30; 
that of Pi, Pa is by no means so obvious, and need not detain us here. 


Messenger of Mathematics, vol. 49 (1919-20), p. 81. 

+ The factor + 1 being immaterial for our purpose, it is ignored. The sign is — for j =3, 
10, and for the rest +. In all there are 64 possible functions Sag; the remaining 48 need not 
concern us here. There is an obvious advantage in deriving the functions from one of them, 
Gio (any other of the 16 might have been selected), instead of from two distinct fundamental 
series as done by Hermite. 
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11. Series for the functions of the second kind.* Applying the substitutions 
to dioo, writing = (T, /), 2, =X+ 7/2, y+ 7/2, and de- 
veloping in powers of q, we find after all reductions the following four in which 
the type is 71, 


(31) oi(2,y) = dro = TL sin (dx + dy)], 
(32) (, Y) = = G1 (2, /) = 1) "cos (dx + d)], 
(T, )) = diss = 1 (“, ) 
4 1) % sin (dx + 
(34) (, )) = (T, J)) = ); 
eight in which the type is Ta, 
(35) . (, ) = Gon = ese y + ALL sin (2 dx + /], 
(4, 7) = = (2, %) 
= secy + 4 Dig"l cos (2. dx + %)], 
de (x,y) = = G4 )) 
csc + AL = 1)" sin (2°41 dz + dy)], 
(, /) = dos = (4, y’) 
= see y ( C (-) cos (2 dx + 
(39) Su (, = = 2), 
(40) (r, y) = = (J 2), 
(41) y) = dais = (/, r), 
(42) (4, /) = = o7(y, 2); 


and four in which the type is T3, 


(36) 


(37) 


(38) 


This list being of such importance it was calculated and checked in several ways, to 
eliminate printers’ errors prevalent in other forms in the literature. It was calculated: (i) by 
Hermite's method, Sur quelques applications des fonctions elliptiques (Comptes Rendus, 
vol. 85 (1877), ... 94 (1882), Oeuvres, vol. 3, p. 267; (ii) by applying the o; to Halphen’s 
form of $s (x,y) (Traité, vol. 1, p. 418 (15)), and comparing with the same for ¢: (&, y); 
(iii) by carrying out in detail the calculations in Hermite’s paper Sur une application de la 
théorie des fonctions doublement périodiques de seconde espece, (Annales de l' Ecole 
Normale Supérieure, (3), vol. 2 (1885), p. 303, reprinted with corrections in Oeuvres, 
vol. 4, p. 190). Finally it was compared with Hermite’s final list, Oeuvres, vol. 4, pp. 199-200, 
which still contains an error (in the expansion of Si (, ), as may be verified upon putting 
x, / = 0, 2/2 and comparing with the series for the elliptic functions). Equivalent forms 
for certain members of this list quoted by other writers with an indefinite reference to Kro- 
necker, are unreliable, and should be recalculated before use. 
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(43) (, y) = in = cot + cot y +49" [D> sin 2 (dr + d)]. 
(T, = = s (r, = cota - tan / 
+ ALL (- 1)è sin 2 (dæ + 8%], 
G10 (T, /) = G2 = -- Gs (*“, = tan + tan 
— 4A (= 1) sin 2 (dx + ))]. 
(46) (rt, y) = = () 


(44) 


(45) 


V. THE METHOD OF PARAPHRASE, SIMPLE ILLUSTRATIONS * 


12. As already remarked, lists of formulas in the d, 6 form, such as the 
foregoing, are analogous in paraphrasing to tables of logarithms in common 
arithmetic. It will be evident that by combining the series to form identities 
there is implicit in the lists given an infinity of paraphrases such as those 
exemplified in Part'I; and as a systematic derivation of all the most obvious 
paraphrases is out of the question in a paper of this length, we shall limit the 
illustrations to a few only of the paraphrases lying on the surface, choosing the 
examples partly for their own interest, and partly to show one or two of the 
simpler methods for using such lists of developments. For this purpose we 
may select Lfunctions of degrees 1, 2, and confine our attention principally 
to linear separations, the other cases being treated with equal facility. 

13. By the theory developed in Part I, trigonometric products are always 
to be written in their equivalent sum forms before proceeding to paraphrase. 
We shall accordingly write trigonometric identities derived from elliptic in 
the latter form at once, omitting the intermediate product forms, unless the 
separation into sums is not obvious. 

As a first example, consider the eighteen identities of the form 

0% (x) = X G). 
From (17), (23), 
(T, ) X O (r, F) = Gi (, 
is equivalent to 
16 L/ (>> sin dr) X L (> sin dx) 
u (n) — SLV cos 2 dr], 


where, as in all such cases, the types of division being defined in the lists from 
which the series are transcribed, need not be written; here they are Ti on the 


* At this point it will be advantageous to glance through the Introduction to Part I, attend- 
ing particularly to the bar notation for L-functions and the notation for separations there 
explained, also to the illustrative examples of paraphrases, as we shall not again refer to any 
of the notation. See also the references at the end of Part I. We shall cite Liouville’s the- 
orems by giving the numbers of the memoir, page and formula, thus: Liouville (1), 144, (A). 
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left, 72 on the right. On referring to §3 (J) for the d-form of ur (n), and 
equating coefficients of , we get 


2n = m + m“; n= n; m, m', m“ = db, d' 5’, d“ ò“: 


Leos (d’ — d“) x — cos (d 1 — cos 2 dr]; 


and this, on writing f() = f(a), paraphrases into 
2n = m' ＋ m“; n = 2 m; m, m', m“ = db, dd“, d“ 5’: 
(J 


Df (d’ — d“) —f(d' + d”)] = 2* — d)]; 


which is Liouville’s (2), 194, (a), and for a = 0 his first formula, (1), 144, (A). 
It is of course not necessary in any such case as this to replace the numerical 
functions p(n), ete., by their d-forms; but doing so increases the symmetry 
of the paraphrases. For simplicity in writing we shall henceforth put 


f(2|) 
In (J each argument is even. Hence f(a) may be replaced by f(2/2), 
giving 


- = 2 Dar 


In this case no material simplification is thus effected. But when the con- 
trary is the case we shall make the change without notice. A similar trans- 
formation of a paraphrase involving odd arguments is not permissible, since, 
by the definition of an L-function, f (m/2) does not necessarily exist. 

14. The use of (27)-(30) will be sufficiently clear from the following, which 
we give in some detail as it illustrates transformations of several types which 
occur frequently. Any identity involving esc, tan or cot might be chosen; 
we take (a2) X (x) = (), and use (18), (24), getting 
[ese x + AL (> sin dr) 2 = (n) + ese? a — (d cos 2dr). 
Equating coefficients of g", where n = 2*m, a > O, we have for the separa- 
tions n/2 = di 6,, and 
90’ 9a” 


n nn“; 


m, m', m“ = dé, d' 6’, d“ òͤ“, 
the following 


> sin dx ese x + cos (d' — d“) x - cos (d' d“ 
= u (n) — cos 2d, 2. 


Since d is odd, (27) gives for = sin dx ese x the value Du, (d, x), or 
Trans. Am. Math. Soc. 14 
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(d—1)/2 
+2 >) cos are |, 


r=1 


= + 2 cos 2x + cos 4a + cos 6 + --- + cos (d- 1) 4], 


9 


where we have taken ¢’ from §3. We may eliminate the separation n/2 


di oͤr, by the following obvious identity 


Lai cos 201 * = d cos 2d + 2d cos 22 dx . + 2 d cos 24 dr], 


which results upon segregating the odd divisors d and (when a > 1) the even 
divisors 2° d(0 < 8 = a — 1) among all the divisors di of 27? m. Hence, 
substituting the sums thus found for the respective terms of the original sin, 
cos, ese identity, and paraphrasing, we have 


n = 2%m = 2° m’ + 20” m“; a> O; m, m, m = d, d', d“ 3”: 
a’) — f(d + a”) (0) 
(II) — (f(2) + £(4) +f(6) 1)! 
— Lal, (2d) + 2f( 2d) T. . 1247 (2 d) l. 


The subcases of all paraphrases such as (II) in which a = 0 (although this 
does not directly come under (I), a being > 0 therein, the paraphrases are 
closely related, both being consequences of the &: () identity), are of great 
importance in connection with the representations of primes p in the form 
ax? + hre e, where a, h are constants and r is prime, in that the specialized 
forms of these subcases for f(a) = 1, 2°, . give rise to identities of the 
forms considered by Bouniakowsky and Liouville as the point of departure 
for determining the number of such representations. In the present instance, 
the sum Sq" (Xd; cos 2d; r) can contribute nothing to the coefficient of %; 
hence the second © on the right of (II) is absent. Again, for a = 0, we have 
m = 2% m’ + 2°” m". Hence one and only one of a’, 4“ = 0; and therefore 
the value of =[f(d’ - d“) (d “)] over the separation n = 2” m 
+ 2°" m” is the sum of its values over the (identical) separations 


m = m + 2%" %“, m = 2° ＋ m“. 
Finally then we have, on referring to § 3 (4) for the numerical functions, 
m = 2% m’ ＋ m“; m, m', m. = dé, d' d“ 8“: 
(I) [f(d' - d“) — 
— 2D 1f(2) + f(4) + +f(d 1)]. 


This is Liouville’s (3), 201, (D). We have not yet exhausted the obvious 
consequences of n = e X G:; any identity involving ese æ gives at once two 
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paraphrases, one similarly to (II), and the other by paraphrasing the result 
of multiplying the identity throughout by sin x to eliminate ese x. In the 
present case we get, on equating coefficients of gq", n = 2*m, a > 0, as in 
proving (II): 
> [sin (1 +d’ — d“) (1 d d“ 

＋ sin (- IAA“) sin Ad“)! 


= ui (n) sina 2, sin dx 


— aden (21d ＋ — sin (27 d — |, 


ras} 


which, on writing g(x) =f (|), gives the paraphrase 


n = 2°m = 2% m + 2 „“; a O; m, m', m“ = dò, d's’, d“ ò“: 
(IV) —g(1 +d’ +d")] =m(n)g(1) — 


- 40902 d +1) — g(2"d —1)} |. 


The diversity in form of (II), (IV), is the more striking in that (IV) is merely 
the paraphrase of the same elliptic identity as that which gives rise to (II) 
when it is multiplied throughout by sin x. For identities involving ese 2 or 
cot 2 we thus get the paraphrases corresponding to (IV) on multiplying first 
by sin æ; for those containing sec æ or tan æ we first multiply throughout by 
cos x. Similarly to (IV), corresponding to (III), we get, 


m = 2° ＋ m“; m, m', n.“ = dé, d' 5’, d“ òͤ“: 
(V) ＋ d! d“ “ d“) 
—g(1+d'+d")] = Tg (d), 


which is Liouville (3), 206, (E). He remarks that (III), (V) are ultimately 
the same thing, which is obvious from their origin. His (F), ibid., p. 208, is a 
paraphrase (among others) of Jacobi's Z? (uw), Fundamenta Nova, § 47, equa- 
tion (1); also his (4), 242, (G) is from the same source, or it follows from one 
of the formulas for functions f (2, /) given below in (XVII). 

15. Although their arithmetical consequences are often widely different, at 
least in appearance, we shall regard paraphrases that may be derived from 
one another by means of the elementary transformations of Part I (III) as 
equivalent. We shall now show how the simplest properties of the elliptic 
or theta functions such as those in (21) are of direct use in finding all the 
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distinct and equivalent paraphrases implicit in a complete set of identities of a 
given kind, here (r) X = (). 

Obviously Gu X n) = ir () gives nothing distinct from (I), as 
may be seen on glancing at (22). In the same way we dispose of j = 13, 14, 
IS when the paraphrase for = 11 is known; and (21), (22) show that for the 
following pairs of values of j the ¢; = , X & paraphrases will be identical, 


(1,5), (2,6), (3,7), (4,8), (9, 10), (13, 15), (14, 16), 


since in each case the resulting elliptic identities may be transformed into one 
another by changing the sign of g. And from the same source it is seen that 
the paraphrases corresponding to the next pairs will be equivalent in the sense 
that either in a given pair may be transformed into the other by one of the 
elementary transformations considered in Part I (III), 


(1,3), (2,4), (5,7), (6,8), (9,10), (11,12), (13, 14), (15, 16). 


Hence we shall find all the required paraphrases by taking 7 = 1, 2, 3, 4, 9, 
11, 12. The cases j = 1, 2 give the paraphrases (I)-(V); omitting the 
alternative forms that correspond to (IV), (V) we get the following in the 
same way for = 3,4,9, 11, 12. 


2n = m +m"; n = 2° m; m, m', n“ = db, d’ 5’, d“ 6”: 
— 1) te? ff — + f(d' 
(VI) = 2 1)" (2d) £(0)]; 
— d d + + 
= — 2 +f (2% d)). 


These are identical; the first comes from j = 3, the second from j = 7. For 
j = 4 we use 2; from (28), getting 


n = 2%m = 2° m’ + 2" O; 
* m, = do, d' 8’, d“ ò“: 
Lle 1) 2 {f(d’ d ＋ (d- +d’)}] 
(VID! 
= [f"(n) 3m 
—f(4) +F(6) + (- 1% 1)] 
+ La- f (2d) + 2f (22d) + + d)]; 


and the related form 


we 
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m = 2" m' ＋ m“; m, m', m“ = db, d' 5’, d“ ò“: 
2LI( - — d.) + ＋ d)]! 
n) - 
— 27/02) —f(4) + %% 1)]. 


For j = 9 we have 


(VIII) 


2 m = 2° m’ + 2 a 
m, m', m“ = do, d' 5’, d“ òͤ“: 
(IX) d — 27d“) ＋5(2 d' + 2°" dj] 
= [1 (27? m) — (0) 
+ -) — 2 (Z d). 


In this the f (2) which appears upon first paraphrasing has been replaced, as 
allowable, by f (2/2). 


m = 2 +m"; m, m“ = dé, 6’, d“ ò“; a O: 
(X) 2>> ( wi {f (2° + f(2” d. ＋ — 
(d). 


For j = 11, using wo from (29), we find 


n n' n, u', = d, d', d“ 3”: 
1)**8" {f(d’ = d +. d")}] 
(XI) =[p(n) — pi(n)]f(0) — [d+ (- f(d) 


— (g= 1) (f(1) +f(2) + f(d 1)]. 
The special case in which n = m is of interest: 
(XII) = [f(m) {i(m)]f(0) (d- 1) 
+ 201f(1) +f(2) + +f(d -1)). 


The similarly derived paraphrases of the derivatives ¢; = ¢7 G, etc., and 
those for the relations between pairs of functions whose products are constants, 
also yield simple and interesting results, but to keep this paper within reason- 
able limits we pass on to a very brief consideration of the more important 
series (31)—(46). 

16. Let us first paraphrase one of the obvious identities suggested by the 
form of &s (, )) in (43), as the resulting paraphrase is one of those which 
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Pepin ((1), 84-92) proved by Dirichlet’s method. By the addition theorems 
for the 3-functions (Jacobi, Werke, vol. 1, p. 510, (C)), we have 


and hence by (43), (18), on multiplying this identity throughout by 
(x) d(x), 
we get 


(2, (, —y) = () (K). 


Substituting in this the respective series as given by (43), (24), replacing q by 
V, and equating coefficients of 9, we find 


n = +n"; n, n', n“ = d, d' d“ &“: 
l cot x sin 2dx cos 28% — cot y cos 2dz sin 20% 
+ do [cos 2{(d’ d“ + (8 + 8”) y} 
— cos 2{(d’ + d“) + (8 — 8”) y}] = Lad cos 2dy — cos 2dæ]. 


By (29) the first sum* 


d—1 


= 3 [cos 2dy — cos 2dx] + 2 | = cos 2rx cos 26y — cos 26x cos 2ry} |. 


Considering the second sum (on the left), we havef 
>> sin 2(d’ — d“) æ sin 2(8 + 6”)y =0 

> sin 2(d’ + d“) sin 2(8’ — 8” )y, 


the d being identical in reversed order with the d“, and similarly for the 6’ , 6”; 


hence the second sum reduces to 
>> [cos 2(d’ — d“) x cos 2 + 6”) = cos 2 (d“ + cos 2(8’ — 8”) JJ. 


Making all these reductions in the original cot, sin, cos identity, replacing 
then x, / by x/2, y/2 and paraphrasing, we get 


d, 6 may clearly be interchanged in either term under the 2: 

(2d, x) cos 28% — wo (28, /) cos 2dr] = (2d, æ) cos 20% — wo (2d, /) cos 
We have made this change before writing out the = by (29) in the next step; henceforth it 
will be unnecessary to point out similar transformations. 

+ The reduction in this step may be obviated by paraphrasing the right of 
— 4% (, (T, —y) = Los(z,y) — (r, les (2, + - E, 
instead of, as above, the left. Such devices sometimes avoid complicated arithmetical reduc- 


tions. 


8 
at 
4 
& 
ay 
Ae 


ARITHMETICAL PARAPHRASES (II) 
n = aw, = db, d' 6’, d“ 6”; 
=f (2, y): 
(XIID - d“, o + 6”) —f(d’+d , 6’ — 
—1){f(0,d) —f(d,0)}] 
[ 
r=l1 
Putting f(z, )) =f (2x) or f(y) in (XIII), we find at once 
n=n'+n": n,n’, dd, d', d“ 3”: 
- d“) d“)! 
(n) — (n) If (o) + (26 d- 1)f(d) 
— 2 ((I) +f(d -1)). 

Henceforth we shall write f (, y\) 5). 

17. By a simple transformation (XIII), (XIV) take more elegant forms. 
We remark, however, that although slightly simpler in appearance, the new 
forms are in reality not so simple, containing redundant terms; we give the 
transformation merely to show the identity of the forms above with Liouville’s. 

For n = dé, let T' TLF (ö, r)] denote the result of deleting from 
LI Vwi f (6, r)] every f (ö, r) for which r is a divisor of d, and similarly 
for f(r, ö) ], D’[ f(r)]. Then it is easily seen that 

11 d—1 d—1 

0, ) — f(r, 6)} | Ste, — 5) |, 
[f(2) + (3) + +f(d -1)] 

Hence (XIII), (XIV) over the same separations may be written, 


(d’ 6’ 4. 8“) — (d! +d”, 6’ 5’) 
(XIV’) — (o) +d — 26 — 11, (d) 
— 25° 1f(2) + £(3) +++ +f (d 1)); 


which are respectively Liouville’s (5), 284, (f) and (4), 247, (H). 


(XIV) 


2 
7 
x 
7 
2 
— 
1 
a 
9 
5 
at 
Z 
‘ 
i 
5: 
J . 
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18. To generalize a paraphrase that arose from , we seek ¢;(2,y), 
G (r, /) such that 
+ (, —y) = (r) + % (0. 


Thus for j = 7, we have the identity (from (31), and the application of the 
transformations indicated in the first triads of (21) upon (17)) 


61 (, (, —y) = — (y), 

where we have used 
Paraphrasing the ¢-identity as in § 16, we find immediately 
2n = m' ＋ m“; n n; m, m', m“ = db, d' 5’, d“ ò“: 
(XV) — d“, + 6") +d", — 
= 2° (0, 24) d, 0)], 
which is Liouville (2), 199, (b), (e), and which becomes (I) for 
=f(%), fly). 
Since by (33), G (, )) = — 1 (4, ), the paraphrase of 

63 (2, (T, — y) = 63 (x) 6800 
is (XV). From (32), (21) we get 

(T, ) (T, —y) = — G5 (y), 


whose paraphrase may be written down from (XV) by means of the elementary 
transformations of Part I, $30, on observing that G: (, )) = O (r, ). 
Over the same separation as (XV) it is 
[ ( 4 6’ = 5’’) +f(d’ 6’ 8% 
(XVI 
= 2° ( d, 0) — (—1)"f (0, 2 d)]. 


On putting f(x,y) f() in this it becomes (VI), second form; for 
f(x,y) =f (y) it is the first form of (VI). Continuing thus with the obvious 
consequences of the developments in § 11, we find from 


95 0% ( + y) — y) = r), 


— G. (, /) (, - ) = (y) + (2), 


on using uv; (ö, /) from (27), the following paraphrase: 
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n=n'+n’"; n,n’, = 2 n, 2 m', 2 m“; 
m, m', m“ = dò, d' 5’, d“ 6”: 
> [f (2% d’ — 2 d“, + 8") d! + 2“ d“, — 8“) 
(XVII) (-I) (o, 2d) + 2d (0, 2d) T 
+ 21 df(0, 2*d)] — >> (2*d — 1) f(2*d, 0) 
+ 2 (2d, 2) +f 4) + f(2*d, 6) 
+f(2*d,6—-1)]. 


For f(x,y) = f(y) this becomes (II), (III); for f(x,y) = f(a) we find 
after a simple reduction, 


2 m = 2° m’ + 2°" m,; m, m', m“ = dé, d' 6’, d'“ 8"; 
a> 0: 
Cis (ed 2 0) 
2°d)[f(2*d) — f(0)]; 


(XVIII 


and when a = 0, 
m = 2° m' +m": m, m', m = db, d' d“ 8“: 
2 - d) d — d)f (a). 
These three are Liouville's (5), 280, (e); (3), 208, (F); (4), 242, (6). Simi- 
larly from (36) we have 
(T, 7) (r, —y) = O3(y) + (), 


which gives upon reduction by 73 (6, y) from (28), 


(XIX) 


n=n'+n"; nu, n', n“ = 2 m, 2% m, 2“ m; 
m, m', m“ = dé, d’ 5’, d' ò“: 
Dee (2. d, di, — 8 
+f d- ＋ 8,0] 
= (2*d — 17 (2d, o) + 2D [f 2) 2d, 4) 
(— F(2¢d, — 1)] 
— 2d) + 2f(0, 2d) 
27 (0, 24], 
which might have been derived by the elementary transformations from 


(XVII); as it is the transformation affords a check of a kind which frequently 
is valuable. For f (, y) =f (2) this is easily seen to be 


(XX) 


j 

¢ 

{ 
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LI + 2“ + f (2% d' — 2°" d"’)}] 
(XX) (— 1)"}(3 — 20 (0) 
+ D + (— 


over the same separation as the preceding. This is essentially two theorems; 


the first corresponds to a = 0, and is (X); the second is, with a > 0, 
＋ 28" +f (2° 2 d)] 
2˙) + + (— d). 
This is Liouville (5), 282, (K). The examples which we have given are 
probably sufficient as illustrations of the simplest methods of using the tables, 
and the length of this paper forbids more systematic treatment here of the 
lists sampled or of the numerous other sets in the theory of elliptic functions 


which Liouville apparently did not touch. Also we must Jeave aside the 
many interesting questions that suggest themselves; e.g., the classification of 


(XXII) 


the paraphrases, the inverse problem of finding paraphrases for a given separa- 
tion (in which the theory of the transformation of elliptic functions finds a 
new application to arithmetic), and the passage from paraphrases for functions 
of degree 2 to degree exceeding 2. One natural starting point for the last is 
Jacobi's (and II. J. S. Smith's) formula for the multiplication of four theta 
functions, and its consequences for % (r + y +2), ete. This generalization 
is of importance, as it leads naturally to the general case of such paraphrases 
as the present, viz., to the paraphrase of Riemann’s theta formula and the 
theory of the related functions, making possible a ready arithmetical inter- 
pretation of some of the most striking analytical results in the theory of abelian 
functions.* 

In the present order of ideas the series for the ¢ (2), n > 2 furnish inter- 
esting results; e.g., O (a) gives the principal theorems in Liouville’s sixth 
memoir. We may conclude with two paraphrases for quadratic separations, 
to illustrate the fact that there is no increase in difficulty by these methods 
when we pass from linear separations to quadratic. This certainly is not the 
case in methods used hitherto. 

19. From the identity 

(2x) 


and the following, which is easily derived from the known series, 


* This generalization was carried out in some detail for the hyperelliptic functions of the 
first order, and in particular for the transformation theory of such functions, in 1915; the 
results, which lead to interesting arithmetical conclusions, will be published as soon as the 
papers on the elliptic case have appeared. There is a notable gain in generality when we pass 
beyond the elliptic case: the majority of the paraphrases refer to functions of n variables 
unrestricted in any way whatever; parity no longer plays an essential part. 
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3; (1) / (x) = cot æ + 4 L (sin 2dr), [Ta], 


we find upon replacing q by , writing (n) = 1 or 0 according as n is or is 
not the square of an integer > 0, and paraphrasing as usual; 


m = 8n’ + m“; n’ = 


2L (= ο - m) — f(2d’ + “)] 


= (vm —1)f (vm) - 2f(1) — 2f (3) 
— — 2f(vm — 2)]. 
Liouville gave several of the same kind, but not this. He does not seem to 


have used the 3, (4/0, (2). 

20. One of the simplest methods for finding quadratic paraphrases con- 
cerning functions of order > 1 is by proceeding from identities between theta 
functions (not their quotients) and series whose d, 6 forms are known. Thus i 
it is seen by inspection of (35) that 


00 „) - , (r /) = O0, 


where we have not gone beyond two variables in order to keep the writing 

simple. Substituting the series for d ( + y), ( NY, + y), and noting 

that 00 (2) may also be written 2 ( 1) % cos 2n; x, where the range of 
ui is — © to , we find after some simple reductions the following form 
of the d, & identity: 


1 —1 
Se — 1)" sin 2nx Ecos (21 — 


Il 


0; 


N11 


+2 gi b (—1)" sin (2 cos (2 oz 


and this being an identity in , we have the next an identity in æ&, y, (the 
separation in both cases is the same); 


n= ni + no; 11 =0; O < ng = 2% mo; mz = do be: 


an 


e (n) (- 1)" sin Le cos (2r 1) 


121 


+ (— 1)" sin (2 + 2) a cos (2"** dz — 62 + 281) = 0; 


and thence, over the same separation, fl 


( 1)" f(2="" ds + 2n, | 2* d + ny ) 
XXIV 
= e(n)(—1)""* Def (2r vn). 
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ON THE ZEROS OF SOLUTIONS OF HOMOGENEOUS LINEAR 
DIFFERENTIAL EQUATIONS* 
CLARENCE N. REYNOLDS, JR. 


1. INTRODUCTION 


In this paper I shall first prove a general separation theorem for the zeros 
of solutions of the general homogeneous linear differential equation. I shall 
then generalize Birkhoff's theorems of oscillation and comparison for equations 
of the third orderf by proving two series of theorems, one for equations of 
odd order and one for equations of even order. This latter series of theorems 
is then applied to the study of the zeros of the solutions of self-adjoint linear 
homogeneous differential equations of the fourth order. It would be sufficient 
for my purposes if I were to assume that the coefficients and solutions of the 
equations considered, together with a sufficient number of their derivatives, 
were defined and continuous for all values of the independent variable con- 
sidered. In order to simplify the statements of my results I shall assume 
these coefficients and solutions analytic. 

We may, without loss of generality, take our nth order equation in the 
form 


(1) D. =0 (aS Sb) 
and assume that the wronskian of any fundamental system of those solutions 
of (1) whose zeros we are studying is identically equal to one; i. e., 
71, 


The equation adjoint to (1) is 


(2) 2 + (- 1) = 0. 


122 


Presented to the Society, September, 1918, and September, 1919. 
t Birkhoff, On the solutions of ordinary linear homogeneous differential equations of the third 
order, Annals of Mathematics, ser. 2, vol. 12 (1911), pp. 103-127. 
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This equation is satisfied by 2 (), the cofactor of / () in W(2), 
= 1. 2, % . By the properties of z;(2) as a cofactor, 


11 


Ly (2) = 0 (x 0, 1, 


t=] 
n 


LV (r) (x) 


121 


Throughout this paper I shall denote determinants of the form 
ys cee 


(ke) 


(ke) (ke) 
91 Yr Yn 


yi») 
by the symbol: 
(Kp 
L Yn 9 
2. A GENERAL SEPARATION THEOREM 


THEOREM 1. Jf x, and 22 are consecutive zeros of yi: (x), and if ½ () does 
not vanish for either of these values of x, then the number of zeros of y (x) between 
a, and æ plus the number of zeros of (V, y2) in the same interval is odd. 

If for definiteness we let 


yi(x) >0 (a (), 
then 


71 (41) = yi (21) (a1) = 0, yi? (21) > 0, 
71 (42) = yi = O, (— 1) y\ (a2) > 0, 


where J en and k<n since W(x) +0. Differentiating the function 
f(x) = y(n, y2) repeatedly, we find that 


fem (21) m<j-1, 
(x2) = O, m<k—1, 
(i) = - (a1) () < 0, 
= — (= (a2) (a2) < 0, 
(10 (x) > 0. 


Hence the number of zeros of f () between x; and x2 is odd. Therefore the 
number of zeros of y (r) plus the number of zeros of (yi, y2) between x 
and is odd. 

Theorem 1 may be geometrically interpreted by supposing the numbers 
(), +++, Yn () to be the homogeneous coördinates of a point on 


ꝙ—92—ñͥͤ —c— t: (— 221 | 
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an analytic curve in space of n — 1 dimensions. This curve is the integral 
curve of equation (1). The condition W(2) + 0 then means that the oscu- 
lating (u — 2)-way plane spread is never stationary. The determinants 
(yi, %) (4,7 =, 2, „ ), — 1)/2 inn umber, are the homogeneous 
line coördinates of the tangent to the curve at the point (y1, ye, „ ) 
The vanishing of (%, y;) at a point on the curve means that the tangent at 
the point meets the (n — 3)-way plane intersection of the (n — 2)-way 
plane spreads, whose equations are y; = 0 and y2 = 0. 

Hence Theorem 1 may be read as follows: 

If the integral curve of equation (1) does not meet the (n — 3)-way plane 
spread whose equations are /1 = 0, y2 = 0, then between two intersections 
of the curve with the (n — 2)-way plane spread whose equation is y; = 0, 
there are an odd number of intersections with the (n — 2)-way plane spread 
whose equation is y2 = 0, and points of tangency with elements of the pencil 
of (n — 2)-way plane spreads whose equation is ay; + By2 = 0, where 
a and g are parameters. 

For equations of the second order this theorem is equivalent to Sturm’s 


theorem“ that the zeros of two linearly independent solutions alternate, since 
(v1, %½) + Oin this case. For equations of the third order it is equivalent to 
Birkhoff's general separation theorem. 


3. REGULAR INTERVALS 
Let 


n 


(5) (2 (E) yi (2) 


i=l 
then from (3) and (4) we have 


k 
(6) ( x (* 0, 1, 25 „ 2), 


and 


(7) 


(r, 8) 1, 


From (6) and (7) we can prove by substitution that if j (4) is a function satis- 
fying 
8) n(x) — f o(x, 
a 
where / () is an arbitrary solution of (1) and R(2) is an arbitrary analytic 
function of x, then 7 (2) is a solution of the linear differential system 
*Cf. C. Sturm, Journal de mathématiques pures et appliquées, 


vol. 1 (1836), p. 131. 
t Cf. Birkhoff, loc. cit., p. 109. 


ͤ 
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(9) 1 + * pin i + Ryn = O, 


(10) n® (a) = y™ (a) 
Interchanging the roles of (1) and (9) we may define 


(5) v(x, 5) 


where Y (), (i = 1, 2, % n), is a fundamental system of solutions of 
(9) and the ¢’s are derived from the 's as the z’s were derived from the y’s. 
Then 


(6’) 


37, S) O, 1,2, 2), 


7) (2, 1, 


and 


(8) y(x) = n(x) + f 


Now, if we set / (r) = (, q) in (8), we have 


since Y (, a) (, a), = E, (I= 0, 1, 2, mn — 1), by (6), 
(7), (67), and (7). It will now be possible to compare solutions of (1) and 
(9) in intervals which satisfy the following 
Definition. An interval (a S S) shall be said to be regular with 
respect to (1) and of the first [or second] kind whenever the following condi- 
tions are fulfilled, as 2 and E vary throughout the interval: 
Kind of Regular Interval n Odd n Even 
Second 62, ) a0, a2 4 €) <0, 2 < &. 


The proof of the following theorem does not differ essentially from the proof 
of the special case for the equation of the third order, which Birkhoff has 
given.* For this reason I do not give the proof here. 

THEOREM 2. If an interval is a regular interval of the first [second] kind 
for an equation (1) of odd order, then it is also a regular interval of the same kind 
for the equation (9), provided that the inequality R(x) = OH (A) = 0] obtains 
throughout the interval. 

THEOREM 3. Any regular interval of either kind for an equation (1) of even 


* Loc. cit., p. 117. 


' 
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order throughout which It () S 0 is also a regular interval of the same kind for 
equation (9). ° 

If a is any number in our interval, then, by (6’) and (7’), V (2, a) is posi- 
tive if x is in the immediate right-hand neighborhood of a. If y(2,a)=0 
for any value of x greater than a, then, since ¥(2, g) is continuous, there 
exists a number, 29, greater than a, such that (, 
< 2’ < ae). 

If our interval is regular and of the first kind for (I), then & (, &) > 0, 
( S SJ). If we now let R(x) = 0 throughout our interval and sub- 
stitute ay for x in (11), we have 


0 = G (o, E) R(S)V(E, ) dE, 


or zero equal to the sum of a positive quantity and an integral which cannot 
be negative. Therefore the assumption that V (, a) S 0 for any value of x 
greater than q leads to an absurdity. Hence W (, a) > 0 for x > a, where 
a is any value of & in our interval, and our interval is regular and of the first 
kind for the equation (9). 

Similarly our theorem may be proven for regular intervals of the second 
kind. 

Tukonku 4. Jf (a Sab) is a regular interval of the first [second] kind, 
for an equation (9) of odd order, throughout which R(x) <0 [R(x) > 0], 
except for at most a finite number of zeros, and if y(x) and r) are non- 
identically vanishing solutions of (1) and (9), respectively, such that 


y™ (a) (a), a Sb (k=0,1,---,n-1), 


then between a and the least |greatest| zero of n(x) [y (x) ] which is greater [less] 
than a, there exists at least one zero of (r) l)] at which y(x)[n(x) } 
changes sign. 

For definiteness, let the first of the numbers 00 (a) which does not vanish 
be positive. Then, in the immediate right-hand neighborhood of x = a, 
both y (r) and y(x) are positive. Now, let @ be such that (F) = O, 
n (r) YO (aS OA). Then, substituting x = & in (8’), we have 


Now, if our interval is regular and of the first kind, Y (F, S) = 0(# > &) 
and H (F) <0 (a SSS), except for at most a finite number of zeros. 
Hence, if we suppose that / (x) does not change sign between a and , we have 
/ (E) 20(a@ SSE anda non-negative quantity y equal to an integral 


that must be negative, which is absurd. Therefore, / (r) changes sign at 
least once between q and . 


4 
q 
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If our interval is regular and of the second kind, a similar proof will hold. 

If we change the sign of R (), we interchange y (x) and () in the con- 
clusion. 

THeorEM 5. If (a Sr Sb) is a regular interval of the first [second] kind, 
for an equation (9) of even order, throughout which R(x) < 0, except for at most a 
finite number of zeros, and if y(x) and n(x) are non identically vanishing solu- 
tions of (1) and (9), respectively, such that 


y™ (a) = 9 (a), (a U) 
then between a and the least [greatest] zero n ), which is greater [less] 
than a, there exists at least one zero of () [n(a)] at which C l)! 
changes sign. 

A proof similar to that of Theorem 4 proves this theorem, and, as in 
Theorem 4, a change in sign of R(x) interchanges / () and y(2) in our 
conclusion. 

4. SELF-ADJOINT EQUATIONS OF THE FOURTH ORDER 


The general self-adjoint linear homogeneous differential equation of the 
fourth order, with the coefficient of the first term identically equal to 1, can 
be written in the form 


(12) + 10 U“ + 10% + (357 + 9p} + R)n = 0, 


where pz () is not the function used in (1). Here the form of the coefficient 
of ij is not dictated by the condition of self-adjointness, but is chosen for the 
purpose of relating equation (12) to the equation 


(13) + 10 + 1002 / + (357 + 9p?) y = 0, 
which is satisfied by the cube of any solution of* 
(14) u’+pu=0. 


Furthermore we know that if y () is any solution of (13) then 7 (x) as defined 
by (S) satisfies (12). If G2 (r, &) is the solution of (14) defined by (5), thenf 


0 
62 (, f) O, 5762 (, f) 1, 4 k, 


and 


02 


Ct. Brioschi, Acta Mathematica, vol. 14 (1890), p. 236. 
7 If ur (r) and uz (æ) are two solutions of (14) for which (ur, u“) = 1, then 


(T, f) = — u (F) un () T (E) uz (2). 
Trans. Am. Math. Soc. 15 
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Therefore if G (a, f) is the solution of (13) defined by (5), then 


6% (T, &) = 30 (2, E). 


Hence by the definition of regular intervals any interval which is regular for 
(14) will be regular for (13) and conversely. Therefore we can substitute 
(14) for (1) and (12) for (9) in Theorem 3 and obtain 

THEOREM 6. Any regular interval for (14) is a regular interval of both kinds 
for (12) provided that N () S O throughout our interval. 

Similarly we may substitute (13) for (1), (14) for the first (9) and (12) for 
the second (9) in Theorem 5 and obtain 

TueorEeM 7. Jf (Ei Sx Ez) is a regular interval for (14) throughout 
which R(x) < 0 except for at most a finite number of zeros, and if y(x) and 
n (a) are non-identically vanishing solutions of (13) and (12) respectively such 
that %% (a) (a), Ei S a S kz, (k =0,1,2,3), then between a and 
the least | greatest | zero of n(x) [y(2)] which is greater [less] than q there exists 
at least one zero of [n(x)] at which y (x) [n(2)] changes sign. 

In applying these two theorems it is to be noted that sometimes the regu- 
lar intervals for equation (14) are bounded by two consecutive zeros of a 
solution of (14), but that if (14) has non-oscillatory solutions then they are 
not so bounded. 

Definitions. The forward interval of oscillation at x = a for a given equation 
is the least interval (a, 8) such that all solutions vanishing for x = a will vanish 
again in (a, 8). 

The backward interval of oscillation at x = a@ is the least interval (B, ) such 
that all solutions vanishing for x = a will vanish again in (8, a). 

Tukokku S. If R(x) < O except for at most a finite number of zeros and if 
equation (14) possesses a backward interval of oscillation at x = a, then equation 
(12) possesses a backward interval of oscillation at x = a which is not greater 
than the backward interval of oscillation for equation (14). 

Any solution of (13), being a homogeneous binary form of the third degree 
with real constant coefficients in any pair of linearly independent solutions of 
(14), has at least one real linear factor. This factor vanishes once and only 
once in the interval (2; S x < a) where 2 is the zero of G: (, a) which 
immediately precedes a. Therefore, equation (13) possesses a backward 
interval of oscillation at x = a. Since o (, a) satisfies (13), this interval 
is equal to the backward interval of oscillation for (14). 

By the preceding theorem any solution, 7 (2), of (12) which vanishes for 
x = a, vanishes and changes sign at least once between a and the greatest 
zero less than q of that solution, y (2), of (13) which satisfies the conditions 
(10). Therefore, 7 (x) vanishes and changes sign at least once in the interval 
(a1 = a). Of all the zeros, infinite in number, immediately preceding a, 
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of the various solutions of (12) which vanish at x = a there must be a lower 
limiting value, &, which is not less than x; , such that all solutions of (12) which 
vanish for x = d vanish again in the interval (F Sr (a). This interval 
is the required interval of oscillation. 
5. A SEPARATION THEOREM FOR SELF-ADJOINT EQUATIONS OF THE 
FOURTH ORDER 

If u, (r) and 5 (&) satisfy equation (12) then we can prove by differ- 
entiating or by Lagrange’s identity* that 
(15) P (ni, 1j) = (1, 05°) — (1, + 10p2 (U, 05) 
is a constant. 

Again it is well known that the six functions 

(ni, 77) (i,j =1,2,3,4,i <j) 
satisfy a linear differential equation of the fifth order.t Therefore, they 
must be linearly dependent. If we expand the following linear combination 
of identically vanishing determinants, 
(16) (m, 12“, 03,4) — (m, m2, 23,11) + 1052 m2, 23,1) = 0, 
in terms of the two-rowed determinants of the first two rows we have the linear 
relation 
17) P(m, 12) (ns, 1) — P(m, 13) (m2, 14) + P(m, 04) (m2, 03) 

(47 
＋ P (u, 13) (U, 1) (u, 04) (U, 23) +P (As, 04) (m, 12) =O. 


Furthermore, it can be shown by actual expansion of terms that 

(18) = P(m, 03) + P (ms, 04) 02 + P 12) 08, 

the other ¢’s being obtained by cyclic permutations of the subscripts. Sub- 
stituting Fi in (17) we have 


72, 73, 74 


(19) (m, oi) + N2> N35 1 0, 
P(m, 12), P (ui, 73), P (ur, 4) 


where our three- rowed determinant does not involve 7; (2) explicitly. Solving 
for (r, Fi) and substituting in Theorem 1, we have a general separation 
theorem for self-adjoint equations of the fourth order. 

THeorEM 9. If ni (x) (i =1,2,3,4) are linearly independent solutions 
of (12) then between two consecutive zeros of n, (x) at which 


Cf. Bocher, Legons sur les Méthodes de Sturm, Paris, 1917, p. 23. 
7 Cf. Forsyth, Philosophical Transactions, vol. 179 (1888), p. 455. 
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di () [= — (1, 03, 24 )] 


does not vanish, there exist an odd number of zeros Fi (r) and 


73, 74 
| 52, N35 
P (ur, 12), P (u, 13), P (ur, 74) 


Interpreting this theorem geometrically by means of the curve whose 
parametric equations are y; = 7:(2) (4 = 1, 2, 3, 4) and the conical pro- 
jection of this curve upon the plane /1 = 0 from the vertex (1,0,0,0) we 
see that between two consecutive intersections of our curve with the plane 
yi = 0 at which the projected curve has no point of inflection there will exist 
on the projected curve an odd number of points of inflection and points at 
which the tangent to the curve passes through the point [0, P(m, m2), 
P(m, 73), P(m,)]. 

6. APPLICATIONS 

In this section I shall apply the theorems derived in Section 4 to two 

examples. 


Example J. 
(a) + 10n”’ +2n = 0, 


We shall compare the solutions of this equation with the solutions of the 
equation 
(b) + 10y” + 9y = 0. 


Here R(x) = = 9 S O, and (0) is satisfied by i () whenever 


(c) ＋ u 0. 


- Every interval of length z is an interval of oscillation for equations (6) and (e). 
Therefore, by Theorem 6, (9 - 7 S229) is a regular interval of both 
kinds for (a). Theorem 7 implies that if two solutions, n(x) and y(2), of 


(a) and (b) respectively, are such that 


(a) a), (* =0,1,2,3), 


then between q and the least [greatest] zero of (%) which is greater than a 
less than a] there exists at least one zero of / () at which y(7) changes 
signs. By Theorem 8 we know that for a =9, (a) possesses a backward 
interval of oscillation at x = a of length not greater than 7. 

Example 2. 
(a) — = O, 220. 


Here R(x) - e. In this case our comparison equation 
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(b) =0 


has no intervals of oscillation. If the particular solution 7; (2) satisfies the 
boundary conditions 


(e) = 1, a 


then / (r), the corresponding solution of (b), is identically equal to 1. 
Therefore, by Theorem 7, y () cannot vanish for any value of x greater 
than a. Similarly none of the other principal solutions of (a) at x =a 
vanish for x greater than a. 

If y (a) satisfies the boundary conditions 


(c’) (1) =0, (1) 1, nm (1) =0, (1) =6, 
then 

(T) = — 1) (- 2). 
Therefore by Theorem 7, 9 () has at least one zero in the interval 
(0 =2=1), no zero in the interval (1 = S 2), and may have zeros for x 
greater than 2. 


A GENERALIZATION OF THE FOURIER COSINE SERIES* 


J. L. WALSH 


It is well known that on the interval 0 = x = large classes of functions 
arbitrary except for certain restrictions as to oscillation or as to continuity 
and the existence and continuity of derivatives or as to some similar property 
can be developed into series in terms of the functions {cos nx}. The series 
are of the type 

ao 
(1) r) => + cos x + cos 21 
where any coefficient a, may be obtained by the formal process of multiplying 
equation (1) through by cos Mrd and integrating term by term: 


a, = — f(x) cos nade. 


1990 


The present paper considers the problem of developing arbitrary functions 
throughout the same interval in terms of the second set of functions {cos x , 
where X, is very near to n. Under suitable restrictions it is proved that the 
two developments have essentially the same convergence properties f We 


* Presented to the Society, September, 1920. 

+ This study was undertaken at the suggestion of Professor Birkhoff, who proposed an 
analogous generalization of the Fourier sine series. There is no analogous generalization of 
the sine series such that developments in terms of the two sets of functions {sin nz} and 
Sin A, x}, where X, is near but not equal to n, will have the same convergence properties on 
the interval O=2=~7. For at the right-hand end of the interval any particular function 
sin u is not zero and hence cannot be developed in a uniformly convergent series in terms of 
{sin nx}, but it can be developed in a uniformly convergent series in terms of {sin r. 
Thus the methods of the present paper cannot be used to give a generalization of the sine series 
op the interval 0 =z =z, but these methods with slight modifications would enable us to 
replace the set {sin nz} by a set {sin A, 2} such that developments in terms of the two sets 
of functions will have the same convergence properties on the interval 0 = z = 7 e, where 
e is arbitrary. 

The methods used in this paper are in spirit closely related to the methods used by Birk- 
hoff, Paris Comptes Rendus, vol. 164 (1917), pp. 942-945, who gives a similar 
generalization of Taylor’s series. Birkhoff makes use of the theory of integral equations, but 
the related theory of infinitely many variables as used in the present paper leads to more 
general results. 
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shall say that two series 


have essentially the same convergence properties when and only when the 
series 
xz 
(a; -- b; u;) 
il 
converges absolutely and uniformly to the sum zero. 
The set of functions 
2 2 
—. 2, 27, 
1 17 
is a normal and orthogonal set on the interval 0 =2=7. Any set of func- 
tions , continuous on a finite interval is said to be normal and orthogonal 
on that interval if and only if 


* 
ii; ij dx = 6;; 
e 


here and throughout this paper 6;; is the Kronecker symbol which is unity 
or zero according as i and j are or are not equal. In this integral and below 
we omit the argument, which is æ in every case, and also omit the limits of 
integration, which are the ends of the interval considered. We shall prove 
that corresponding to the set of functions 


1 5 2 
cos Ap r, cos Riæ, cos Rz æ, 


there is another set of functions such that the two sets are biorthogonal for 
0=2=7. Tuo sets of continuous functions ] and {v,} are biorthogonal 


1 = dn 


We shall prove a general theorem concerning the generalization of expansions 
in terms of any uniformly bounded normal orthogonal set (d, by considering 


on an interval if and only if 


a set %] where u, is a function neighboring to i,. Briefly described, the 
method is to expand the ½% into series in terms of the du, to invert the corre- 


sponding system of equations and expand the @, in terms of the u,, and 
finally to substitute the latter series into the expansion of the arbitrary func- 
tion in terms of the @,. This gives the expansion of f () in terms of the . 
The form of presentation of the material is slightly changed by the intro- 
duction of a set of functions ( such that n] and {v,} are biorthogonal sets. 
The various steps in the proof will seem much less artificial if the general 
method is kept in mind. 


2400000 231 
> a; u,, U; 
i=1 i=l 
— 
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After the general theorem is proved, application is made to the cosine 
series. We add that the method seems to be of very wide applicability; it is 
hoped later to give further applications to the Sturm-Liouville and other series 
developments. Throughout the present paper there is more regard for sim- 
plicity than for ultimate generality. 


I. A GENERAL THEOREM 


TueoreM I. Suppose that (d,] is a set of uniformly bounded normal ortho- 
gonal functions in an interval, and that in this interval {u,} is a set of uniformly 
bounded continuous functions each of which can be developed into a series 


(2) Un = + (n 1, 2, ), 
k=1 

where the coefficients have the values 


(3) Cnk + = fm d dx. 


Suppose further that the three series 


(4) Eels, (Sen), (Ler) 


i, t=1 k=1 


converge and that the value of the first is less than unity. 
Then there exists a set of functions {v;} such that {u;} and {v;} are biorthogonal 
sets: 


(5) und = (i, 1, 2, 


Furthermore, if f () is any function integrable and with an integrable square (in 
the sense of Lebesgue), then the two series 


(6) 164) Sar 


(7) f(z) ~ 


where 


a; = ax, 
(8) 
b; = ft: dz, 


have essentially the same convergence properties. 
The sign ~ is used simply to indicate that the coefficients a; and h are given 
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by (8). We shall give later some of the more immediate consequences of the 
fact that (6) and (7) have essentially the same convergence properties. 

To prepare for the inversion of system (2), which is an important step in 
the proof of Theorem I, we shall use the following 

Lemna: If for the system 


(I Ten) + C13 %+ 
Cor + (1 + C22) 42 + Co3 
C31 C32 + (1 + €33) a3 + - 


(9) 


the series 


2 


7 


E 


2 
Cis 


j=l 


converge, and if 2 <1, then the system has one solution {x;} and only one 
solution for which 


converges.* 
The lemma is proved by the method of successive approximations, setting 


= Cis 


— + ena +---], 
* — [ea — C12 x” 


From the well-known inequality 


n n 4 n 4 
* 1 k=1 k=1 
(which holds for all sets of numbers far] and {8;,}) and from the convergence 
of the series whose sums are denoted by and 22, it follows that the series 


are all absolutely convergent and that the system {2;} as thus defined is a 


This lemma is included implicitly in some results of Hilb, Sitzungsberichte 
der Physikaliseh- medizinischen Sozietät in Erlangen, vol. 40 
(1908), pp. 1-6. Hilb makes use of the theory of bilinear forms in infinitely many variables, 
so that his proof differs in form (but not in substance) from the proof here presented. We 
indicate the proof because of its simplicity and because we shall use later the inequalities 
obtained in the course of the proof. 


Cl, 
= 
— 2 
L ei, 
|| 1, j=1 
2 
i=l 
1) 
(2) 
(n+1) 
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solution of (9). 


If we introduce the notation r. = Lee 


ij? 


[April 


it is readily 


proved by further use of the same inequality that 


= 1 p’ 

(11) 
Sn 


The series Lr 27 converges, for we obtain the relations“ 


Tei SL . — + le; (2. ＋ 2 Dei. 


il i=l i=l i=l 


For if 


there were, we should have a solution {y;} of the homogeneous system cor- 


There is no other solution of system (9) such that 22. x; converges. 
responding to (9) and such that the series 


L yi 


i=l 


ll 


converges. In the /th equation of the homogeneous system, if we transpose ½ 


to the right-hand member, we obtain 


and hence we also have 


In this same manner we have 
x 
k=1 


where n is any positive integer, and hence y, = 0 (/ = 1, 2, ). 
We apply this lemma (the proof of which is now complete) to determine the 
set of functions fee of Theorem I. Let us assume a development of the type 


(12) 2 (di + dt, where dy; + = for ii; dx. 
i=l e 


The condition that the 7, have the desired property (5) is given formally 
from the developments (2) and (12) by the conditions 

(13) Chi di + dia + Cre dig + d + +++ = 0 

for every i and k. When i is kept fixed, this is precisely system (9); the 
* By use of the inequality 


(a +B)? S2 ( 


— — 

— 
Yi. pi, 
* 
k=1 
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conditions of the lemma are fulfilled by virtue of the hypothesis of Theorem I, 
and therefore there exists a solution (d, da, dia, :),“ with the properties 


stated in the lemma. If we choose that solution and use inequalities (11) and 
the convergence of series (4), we actually arrive at a system of continuous 
functions (r] which satisfy (5) and are given by (12) and (13). 

Our preparation for the inversion of system (2) is nearly complete. We 
introduce the notation 


(14) = Chi ＋ dix + ei di + dr + ds + 
and shall prove EHF = 0 for every i and . In the set of equations 

= + du + en di + ert dzi + est dai + 

= + da + ei dia + €2; do. + cs. + 

= ¢3; + d + ei dis + C2: dzs + ¢3:ds3 + 


multiply the ith equation by % and sum by columns, making use of (13) 
and (14). We obtain the result 


ck €i1 + Cro ＋ + = 


The formal work is readily justified by the convergence of series (4). More- 


over, the series Lier ef converges, from the definition (14) and from (11) 
applied to the set (du, da, diz, .) as a solution of (13), so the set (€:1, ei, 
es, ) forms a solution of the homogeneous system corresponding to (9) 
and hence eg = 0 for every i and . 

We are now in a position to invert system (2). We write the equations 


Cir + die + diz + C21 dox + €31 dE + 
Cree + dx + Cy diy. + C22 do), + C32 da R 
cs + dax + cis dix + C23 dox + €33 dar K 


multiply the ith equation by @; and sum by columns. Justification of the 
formal work is immediate, and we obtain the desired developments 


2 


7 — Ax = L dix 


i=l 
Let us suppose, now, f (a) to be any function integrable and with an inte- 
grable square in the sense of Lebesgue. We find the coefficients (S) by multi- 
For a non-vanishing finite determinant, the numbers di + ö are the quotients of the 
cofactors of the ci, + 6, by the value of the determinant. 


935 
| 

= O, 

=: (). 

=: (), 

(k 1, 2, 3. .). 
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plying (12) through by f () and integrating term by term. This yields the 
result 
(15) 55 — dx = dyy ay, + dis a2 + dig a3 


The difference of the series (6) and (7) may be written 


(a; 4%; b. = Sa; (a; u.) + >> (a; — b.) u. 
ia] 


1=i 


Deu * (da ay, + da dz + — 0 + dei; is), 
j=1 


i=l i=l 


(16) 


which series converges absolutely and uniformly and has the sum zero. We 
make use of the convergence of the series LH dr, which may be proved easily 


from the relation 


(17) (f — andi — d — . U,) dæ S 0. 


Hence, series (6) and (7) have the same convergence properties, in the sense 
that (16) converges absolutely and uniformly to zero. It follows immediately 
that properties of absolute convergence, convergence, summability, and 
divergence obtain for one series as for the other, and likewise the properties of 
uniform convergence in the entire interval considered or in any sub-interval, 
uniform summability, and also of term-by-term integrability. Whenever 
the two series (6) and (7) are convergent, summable, or properly divergent, 
their sums are the same. The nature of the approximating furictions and of 
their approach to the limit (in case of convergence) at a point of continuity 
or of discontinuity of f(a) is essentially the same for (6) as for (7). In par- 
ticular if Gibbs’s phenomenon occurs for (6) it also occurs for (7). The reader, 
will notice various other properties common to the sets (d] and (u,], such as 
the existence or non-existence of a continuous function for which the formal 
series does not converge at every point. 

The conditions of Theorem I concerned with the series (4) can be made 


2 2 ° 2 2 2 
* If we use the convergence of HA a, which is proved from (17); if we write the equations 
51 — dy; a, + die a2 + a3 —— eee 
be — dz = dz; di + doz dz + das da + 


53 — a3 = d3, di + d32 02 + da + +++ 


multiply the ith equation by ci, and sum by columns, we obtain 
ay bi = bi + bz + ba 4 


Hence if the {bi are all zero, So are all the fa; . The converse is evident from 15). 
’ 
We also add that if the series _ a;| converges, so does the series |b; | and conversel 
i=! 121 
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more restrictive so that the convergence properties of (6) and (7) will be essen- 
tially the same for all functions absolutely integrable in the sense of Lebesgue, 
or on the other hand can be made less restrictive and still yield a result say 
for functions such that the series of absolute values of the coefficients Lira, 
converges. 

We add the remark that the relation between the (ur, and the {v;} is essen- 
tially reciprocal—we may obtain corresponding results for expansions in 
terms of the {v;}. Theorem I is readily extended, moreover, by taking as 
point of departure developments in terms of functions füt] which are not 
supposed orthogonal. 


II. APPLICATION TO THE FOURIER COSINE SERIES 


For the interval 0 = a2 Ser, we have mentioned the normal orthogonal 
system of uniformly bounded continuous functions 


1 2 2 2 
(18) —, Tess, — cos cos 
7 Tv T 


Application of Theorem I will yield the following theorem. 
TueoreM II. The expansion of any function f(x) integrable and with an 
integrable square in terms of the functions 
9 


cos Ro æ, cos Ri 2, 
＋ 


COS , cos æ, 


on the interval 0 = x S , where 


2 2 9 ‘ 9 1 
(19) — 1)? + — 2)? + — 3)? + <= 


and where 


(20) — | 

converges, has essentially the same convergence properties as the expansion of f (x) 
in terms of the functions (18). 

It is of course true that our set of functions % can be developed in terms 
of the functions (d, so it merely remains to prove the convergence of the 
series (4) and that the value of the first of these series is less than unity. 

Inequality (17) gives us the relation 


00 
(u. (m 1, 2, 3. ), 


k=1 


and in the present case if m + 1 and if we replace m by n + 1, this integral 


a 
— 
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becomes 


= | (cos A, x — cos na)? dx 
e/0 


* 
= =f [4 sin? 3 (A, sin? 3 (A, - n)ajdzx. 


But we always have the relation sin & = fa, and hence we obtain 


* 


Lei S (M (n = 1, 2, 3 
k=1 
If we omit the factor 4 from the right-hand member we obtain the proper 
formula for n = 0. Satisfaction of (19) therefore means that we have satis- 
fied the following requirement of Theorem I 


>» . 


m, k=1 


It remains to be shown that the last two series in (4) converge. From the 


formula 


4 * 
(22) Cat+1, k+l = (cos A, & — cos nx) cos Ard 
7 
(n 2 


1, 2, k 


integration by parts twice yields the formula 


172 cos kx |" 
= (An sin A, — nN sin nr) le 


0 


(or 


— | (XZ cos M r — Nn? cos nr) 


cos kx 
dx. 
72 
To deal with this integral, we write 
24) cos — cos nx = (AZ — nn?) cos XM, + (cos — cos nr). 
If n > 2, we find with the aid of (19) the obvious relation 


An <2n+2 <n’, 
whence 

— n?| <n? |X, — nl; 
we use this inequality in considering the first term of the right-hand member 
of 24). The last term of the right-hand member of (24) can be transformed 
as in (21), and its absolute value is therefore not greater than n r M — Mn 

The first term in the right-hand member of (23) with the factor cos ka/k* 

omitted, reduces (sin A, 7 — sinnw). Whenn > 1,wehave |A,| Sn’, 
and therefore 


An (sin M, — sin nz)! Sn NM 


238 
11. 

2 
— 

1,2, ) 
(23) 
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From these inequalities for the absolute value of the right-hand member of 


(23) and from the corresponding results for = 0 and n = 2 we prove the 
convergence of 
nk | 
n, k=1 


by means of the convergence of (20) and hence* prove the convergence of the 
last two series of (4) and complete the proof of Theorem II. 

We add the remark that a closer evaluation of c, and also the consideration 
of functions uw, = un cos M x where u, is properly determined, will give 
broader restrictions than (19) and (20). 

HarvarD UNIVERSITY, 
May, 1920 


* Here we make use of the inequality 


n n 273 n 4 
> 2 
> | | = > = > * 
i i= 
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POLYNOMIALS AND THEIR RESIDUE SYSTEMS* 


AUBREY J. KEMPNER 


INTRODUCTION 


The present paper develops an elementary theory of polynomials (with 
rational integral coefficients) with respect to a modulus m, where m is a given 
composite rational integer, and of residue systems of such polynomials with 
respect to a modulus m. Only isolated results have been previously estab- 
lished. 

It is known that, for p a prime, and any set of integers a; (i = 0,1, +--+, 
p — 1), there exist polynomials with integral coefficients of which the set 
form a complete residue system* modulo p such that f (i) = a;. The simplest 
considerations show that this is not true for a composite modulus, m. In this 
case certain relations between the a; (¢ = 0,1, +--+, m — 1) must be satis- 
fied if they are to form a complete residue system modulo m of a polynomial 
with integral coefficients. The problem of establishing necessary and suf- 
ficient conditions for the a; to be, for a given modulus m, such a residue 
system, and the examination of such systems, was the starting point for the 


present article. The existence of a certain type of isomorphism between the 


structure of the totality of reduced polynomials modulo m on one hand and 
the totality of complete residue systems modulo m on the other hand sug- 
gested an independent parallel development of the theory of the polynomials 
and the theory of the residue systems. This has been carried out in Part I 
(Residual congruences and completely reduced polynomials) and Part III 
(Residual congruences and residue systems). 

After the article was written, | became acquainted with several papers on 
Kronecker modular systems which establish in a satisfactory manner relations 
between Part I and the well developed theory of modular systems. The 
short Part II, which was then inserted, is devoted to a brief discussion of this 
connection. 

The classical methods of the theory of numbers may be said to have a 

* Presented to the Society, Chicago, December, 1920. 

*See, for example, Zsigmondy, Monatshefte fiir Mathematik und Phy- 
sik, vol. 8 (1897), p. 20. 
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tendency to consider an investigation closed when the problem has been 
reduced to the treatment of a number of cases where the modulus is a prime 
or a power of a prime, leaving the synthesis of the general case from these 


special cases in descriptive form. The results and methods of the paper may 


make it possible to break down, in some fields of investigation, these barriers 
between the case of a prime modulus or a prime-power modulus, and the case 
of a general composite modulus. 

It is certain that this cannot be accomplished unless the methods are general, 
lead without trials to clean-cut results, and are readily applicable to a given 
numerical case. The last demands will justify the inclusion of numerical 
examples throughout the work. They also made it desirable to treat in detail 
some of the simpler types (m a prime; m a product of distinct primes; m a 
power of a prime, with the sub-cases m = p’, y < pjm = p’,y =p). This 
involves only a small increase in space, since the proof of the formul for the 
general case is naturally based on the simpler cases. 

The methods employed tend to emphasize, for a given modulus, the totality 
of completely reduced polynomials modulo m rather than the individual poly- 
nomial, and the totality of complete residue systems modulo m rather than 
the individual residue system.* 

Part III, however, also furnishes tools for the examination of the individual 
residue system. Some applications of the theory are reserved for another 
occasion. 

The introduction of some new terms and symbols could not well be avoided. 

A brief synopsis may be of assistance to the reader: 

In Part I, for a given modulus, m, a certain finite set of polynomials with 
integral coefficients is constructed such that the residue system modulo m of any 
polynomial with integral coefficients coincides with the residue system of exactly 
one polynomial of the set (F 2-5). We shall call the polynomials of this 


set “completely reduced polynomials”? modulo m ($5). Necessary and 
sufficient conditions, in terms of the coefficients, that a polynomial be com- 
pletely reduced, and the number N (m) of such polynomials, are established 
($$ 5, 6) by means of the “ chain of residual congruences” modulo m (§§ 
3, 4), the signature S(m) (§4), and the “ characteristic“ C(m) (§ 5). 
“The signature and the characteristic are symbols which depend only on the 
modulus m and which are readily found for a given m. The (for our purposes) 
essential properties of the chain of congruences are immediately determined 
from S(m) and C(m). 


*One possible extension of this work, along group theoretic lines, would have points of 
contact with a long paper by Zsigmondy, Monatshefte für Mathematik und 
Physik, vol. 7 (1896), pp. 185-289, Abelsche Gruppen und ihre Anwendung auf die Zahlen- 
theorie; with Weber, Lehrbuch der Algebra, II, 2d edition (1899), pp. 60-68 (Gruppe der Zahl- 
klassen), and others. 
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The main link between Part I and Part III is established by the italicized 
passage above, which indicates the existence of a one-to-one correspondence 
between the completely reduced polynomials modulo m and the complete 
residue systems modulo m. In Part III, arithmetical sequences are made 
use of to establish for any given modulus a certain chain of congruences 
between the elements of a residue system (residual congruences of the second 
kind) ($$ 10, 11), by means of which necessary and sufficient conditions are 
derived that a given set of m integers may be the complete residue system, 
modulo m, of some polynomial with integral coefficients ($12). From the 


“chain of the second kind” are derived a “ signature of m of the second 
kind,“ S(m) ($11) and a “ characteristic of m of the second kind,” C(m) 
($12). It is shown that the new signature and characteristic may be denoted 
by the same symbols as the signature and characteristic introduced in Part I. 
The new signature and characteristic are therefore again derived directly 
from the modulus m, and completely determine the nature of the totality of 
complete residue systems modulo m (§ 13) in a manner entirely parallel to the 
determination of the totality of completely reduced polynomials in §§ 5, 6. 
The number of complete residue systems for a given m is again N (m) (§ 13). 

There exists a complete isomorphism between Part I and Part III, which 
permits us to translate a statement concerning either the system of completely 
reduced polynomials modulo m or the system of complete residue systems 
modulo m into a statement concerning the other system (5 14). A résumé 
($ 14) lists the main features of this isomorphism, as far as they are used in 
the present paper. 

In checking the literature on the subject, Dickson’s History*—particularly 
vol. 1, ch. 8 (Higher Congruences), and ch. 11 (Greatest Common Divisor)— 
has been of greatest value. I wish to express a sincere feeling of obligation to 
Professor Dickson for the assistance which his splendid book has rendered me 


in this respect. 


I. RESIDUAL CONGRUENCES AND COMPLETELY REDUCED POLYNOMIALS 


$1. The number 


DEFINITION 1: We denotet by um), or, when no ambiguity is possible, 
by u, the smallest positive integer such that u ) ts divisible by m. 
We shall need only the following properties of u (m): 


* History of the Theory of Numbers, published by the Carnegie Institution of Washington, 
vol. 1 (1919), vol. 2 (1920). 

Compare Neuberg, M at he'sis, vol. 27 (1887), p. 68; Nielsen, Nieuw Archief 
voor Wiskunde, (2), vol. 10 (1913), pp. 100-106; Kempner, American Mathe- 
matical Monthly, vol. 25 (1918), p. 204. 


POLYNOMIALS AND RESIDUE SYSTEMS 


= 1; u (1) is arbitrarily defined to have the value 0, 
p, pa prime; then um) = p, 

pi D, Pr < Po < +++ < p, primes; then u(m) = 
= p’, y <p; then um) =p-y, 


223 3 
Il 


m= p, + = p; inthis case is no longer the highest power of p contained 
in (5. )! as a factor. Instead, u (n) is now determined in the following 
manner. For any integer / — 1 let p be the highest power of p dividing 
(k —1)!. The change in the exponent p caused by passing from ( — 1)! 
to k! will be as follows: p goes over into p + 7 when k = 0 mod v, but k = 0 
mod p**!; that is, for & not divisible by p, the exponent p does not change; 
for k divisible by v, but not by , p increases by unity, ete., Therefore 
for any prime p and any positive integer Y, the highest power of p dividing /:! 
can be found by the following simple scheme :* 

Write in a horizontal line, as far as required, the multiples of p: I , 
2. , 3 , ; for every positive integer t, write t under each one of these 
numbers containing as a factor “, but not containing vt. For any number 
y in our first row, the exponent of the highest power of p contained as a 
factor in ( y)! is obtained by adding all numbers written under 1 - p, i 
2+-p,-+++,k-+p. This is indicated by the following schedule: 


1 1 1 2 1 1 2 | 1 3 1 

1 2 +++ p—l ptlpt2 . 2p 2p+22p+s3 3). P+p—2 p+ :::. 
Example: p = 3 
1-3 2:3 3:3 4-3 5-3 7-3 8-3 9-3 10-3 11-3 12-3 13-3 14-3 
1 1 1 1 1 1 3 1 1 2 1 1 2 
1 1 5 6 8 9 10 13 14 15 17 18 19 21 


252 -p 2p 


6˙3 


2 
— — 


2 


so that, for example, A (3!) = 24, since 8 - 3 is the smallest integer such that 24! = 0 mod 3”. 
Also, (31) = 3½˙ = = 27, etc. (For y < p, only numbers 1 would occur in the 
second row.) 


m = px? +++ px’, (many positive integer); in this case u(m) is the 
largest (or one of the largest, in case several of the largest are equal) of the 


numbers u (I), (i:) (pr), that is, 
„1 ) 


From the last case, we derive immediately: If mi, me are relatively prime, 
u m2.) = Max. (u (mi), u (mn)). 


u(m) = Max. (u (I.), 


In Kempner, loc. cit., a formula for ꝝ (m) is given. 
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$2. Residual congruences modulo m 


DEFINITION 2: We call two polynomials @(2), Vr), with integral coef- 
ficients, residually congruent modulo m, and write 


when @(2) =() (mod m) for all integers x. 

We shall refer to such congruences as residual congruences.* In particular, 
o(2) =0 (mod m) is equivalent to the statement: @(2x) ts a polynomial with 
integral coefficients which is divisible by m for all integral values of x. 

For convenience, the following facts, most of which are well known, are 
stated as lemmas. 

Lemma 1: (a) For any positive integer d, 


u(d)—1 


I] (2 — i) =0 (mod d); 


1 


(b) For any positive integer d and any multiple m of d 


(x —7)=0 (mod m). 
i=0 

An identical congruence ¢(x) = r) (mod m), which implies that corresponding coef- 
ficients in @(2) and ¥ (xz) are congruent modulo m, is always at the same time a residual con- 
gruence modulo m; but a residual congruence, which only implies that for any integral value 
of x the residues modulo m of & (r) and of / (r) have the same value, is not necessarily an 
identical congruence. The types of residual congruences best known are II, > (—k)=0 
(mod m!), and the Fermat congruence x? = (mod p), pa prime. A partial examination 
of residual congruences has been made by Borel and Drach, Introduction a V'étude de la théorie 
des nombres, etc., Paris, 1895, pp. 339-342, who use as a starting point Fermat’s theorem, and 
by Nielsen, loc. cit. (SI), whose results are closely related to those of the present § 2, and which 
are derived by a similar method. Nielsen's object was the determination of the general type 
of perfect polynomials,” that is, of polynomials f(x) with rational coefficients, not all of 
which are integers, and such that f (r) is an integer for all integral values of 4. Nielsen’s 
paper has no connection with work following §§ 1, 2 of the present paper. The arrangement 
of his work and his notation are not well suited to our purposes, so that a simple reference to 
the article would not be a satisfactory substitute for this § 2. 

A very simple and elegant treatment of the related problem of determining the greatest 
common divisor d of all integers which can be represented by a given polynomial f(x) with 
integral coefficients and for integral values of x, is due to Hensel, Journal fir Mathe- 
matik, vol. 116 (1896), pp. 350-355. Hensel proves that d is the greatest common divisor 
of the n + 1 numbers f(0),f(1), ---,f(), where n is the degree of the polynomial. 

We ajso mention that, using the methods of Nielsen or of the present paper, it would be 
more natural to express, for example, necessary and sufficient conditions that a polynomial 
with integral coefficients be residually congruent to zero for a given modulus, in terms of the 
coefficients a; of f(x) = Za; - (;), that is, in terms of the coefficients of Newton’s interpola- 
tion formula, rather than in terms of the coefficients c; of the polynomial written in the form 
f(x) Den-. For special purposes, McAtee, American Journal of Mathe- 
maties, vol. 41 (1919), pp. 225-242 (239), needed the necessary and sufficient conditions 
that c +c, 2 + ea + ci = O (mod 4), in terms of the er, and found them to be: c = 0, 
201 = 2c2 =e, ce = 0 (mod 4). The present paper does not deal with this problem. 
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Proof: (a) follows from the definition of u(d) (see § 1) and from the fact 
that (i) is an integer. (b) is an immediate consequence of (a). 

Lemma 2: Any polynomial f(x) = cy 
the form* >=" a, (4), and an, en are both different from zero if one of them is 
different from zero. 

Proof: The actual transformation from one form to the other yields an im- 
mediate proof. (See, for example, Hensel, loc. cit. at the end of the first foot- 
note of this section.) 

Lemma 3: (i = 0, I. „ n) are integers, then a;/i! (c=0,1, 

„n), and therefore a;, are integers. 

Proof: Compare coefficients of like powers of x in 


Lemna 4: If f(a) =0 (mod m), then a; SO (mod m) (i = 15 „n). 
Proof: Let successively x = 0,1, ---, m, and consider the resulting recur- 


is uniquely representable in 


rence formule for a;. 
By Lemma 4, if f(a) =0 (modem), and f(a) a polynomial with integral 
coefficients, f (2) must be of the form 


= MQ}, ° 
(1) mar (1) = mao + „II. (2 — iy], 


where a; (k = 0,1, +++, m) are integers (positive, negative, or zero). We 
consider the coefficients ma;/k!. It follows from the definition of u (n) 
that, considering u,v as unknown positive integers, the smallest value of v, for 
which m u = v!, is v = un); and to this value of v corresponds 


u=p(m)!/m. 


Therefore the smallest value of / for which on the right side of (1) the coeffi- 
cient ma;/k! is unity, isk = w(m). 

We have thus derived: 

Lema 5: For a given modulus m the residual congruence 


(m 


(2) 1- II (x — 7)=0 (mod m) 


is of lowest possible degree consistent with the condition that the coefficient of the 
highest power of x shall be unity; or, stating this result in a different form: 
For any divisor d of m 
1 


(3) 7 II (2 — 7) =0 (mod m) 


The fact that this is the expression for the sum of an arithmetical progression of order n is 
of importance in Part III and (implicitly) in Part I (compare § 10). 
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is a residual congruence modulo m of lowest possible degree consistent with the 
condition that the coefficient of the leading term shall have with m the greatest common 
divisor m/d. 

From the preceding we abstract the following 

TueoreM I: (a) In any residual congruence modulo m, of degree exactly 
u (d), where d is any divisor of m (including d = m and the trivial case d = 1), 
the coefficient of the leading term* is either m/d or a multiple of m/d. 

(b) In any residual congruence modulo m, of degree < u(d), the coefficient of 
the leading term* has with m a greatest common divisor larger than m/d, and is 
therefore a multiple of m/d. 

For our purposes, the degree and the coefficient of the highest power of x 
in a residual congruence will be shown to be of particular importance. With 
this in mind, we introduce the following 

DEFINITION 3: We shall usually indicate a residual congruence either by 


e-2"=wW(2) (mod m), 


where W (a) stands for the wordst “any polynomial in x of degree 
<n, with integral coefficients, and which is resid- 
ually congruent to c-a", modulo m,“ or, using a still more 
condensed notation, by {n, c}, or {n, ea, or {n, e] mod em, placing in evidence 
only the degree n, the coefficient e of the highest power of x, and, where advisable, 
the modulus m. 

In most cases, the properties just mentioned of W (2) [ = ()] are the 
only ones we shall make use of; consequently, we do not usually think of 
y (a) as being any specific polynomial, but as any polynomial satisfying these 
conditions. 

To illustrate by a simple example: m = 30; 1·π r (mod 30); but also — (x — i) 
= 0 (mod 30); 5 (& — 1) ( — 2) O (mod 30); 15 4 ( —1)=0 (mod 30). There- 
fore we may choose in 1 - (r) (mod 30), (r) =, or (r) (& i), 
or Y () ＋ Sr (r 1) ( 2) + (de? er ＋ (r - 1), where a, 
bh, d, e, f are any integers; etc. See also Part II. 


§ 3. Construction and discussion of the chain of residual congruences 
modulo m 


For n given modulus m we may derive a set of residual congruences by 
choosing all positive factors of m, including m (and unity, for reasons of con- 


venience) and deriving the corresponding congruence (2), or (3), of $2. We 


* “ Coefficient of the leading term” = “ coefficient of the highest power of the variable 
which is not = 0 (mod m).” 

+ Occasionally we shall indicate the fact that / (x) is of degree not higher than n — 1, by 
writing (). 


— 


1921] POLYNOMIALS AND RESIDUE SYSTEMS 247 


shall show that these congruences will fall into subsets such that all con- 
gruences of any particular subset are implied by a single congruence of the 
subset. These dependent congruences we shall reject, and the set of con- 
chain of residual congruences modulo m.“ 


‘ 


gruences retained we shall call a 
We accomplish this by the following process, the discussion of which forms the 
object of the present and the following paragraph. 

CONSTRUCTION OF CHAIN: 1. Arrange the numbers u(d), where d ranges 
over all positive factors of m (including m = dy and 1) in order of non-increas- 
ing magnitude of ud). Since the u (d) are not necessarily all distinct, they 
will break up into subsets (some or all of which may contain a single element), 
such that in any subset all u (d) are equal. 

2. Arrange in each subset the u (d) according to decreasing values of d, 
i. e., according to increasing values of m/d. 

3. From each subset, select the first element. Let A (d.) be the element 
thus selected from the (i + 1)th subset (where do = m), then the numbers 
u (do), u (di), „ u (d.), (d. = 1, u (d.) = 0), are seen to be arranged 


so that 
(a) u (d.) >p(d;), for 


0% di F<F, for i<j. 
7 


d; 
4. For each u (d.) (i = 0, 15, „ 7) form the residual congruence 


= 1 — 
d, () (mod m), i. e., 00, 


5. The. set of residual congruences modulo m (of which the first is 


(mod em), 
and the last is the trivial congruence m - 2° =0 (mod m)) 


m m 
d. V (2) mod m, i. e., (i 0, 15 7), 


is called the chain of residual congruences modulo m, or, 
the chain of congruences modulo m. 

This process will be referred to as“ Construction, § 3.“ From the Con- 
struction and from Theorem I we read off: 

Lemma 6: In the chain of congruences as defined above, m/d; is a proper factor 
of m/ (i = 0,1, 7 1), (and consequently di d proper factor of d.). 

For the further discussion of the chain we shall need a few facts concerning 
the types of relation which may exist between two residual congruences for 
the same modulus. 
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Let {u, g} be a residual congruence modulo m, and (g, m) = e the greatest 
common divisor of m, 9. Then the existence of ſu, g} implies the existence 
of {u, e, and conversely, as is seen by multiplying both sides of the first 
congruence by g’, where 9 g’ = (mod m), and both sides of the second 
congruence by g/c, respectively. For this reason: 

We always assume in a residual congruence that the coefficient of the highest 
power is a factor of the modulus: in ſu, e (mod m), ¢ is always a factor of m 
(including e- 1 and the trivial case e = m). 

To the end of this paragraph, all residual congruences have the same modu- 
lus, . In a system of two congruences (ur, eil, ſuz, ez] we have nine 
possible combinations of ur S u, ei Se: By interchanging indices, these 
nine cases are reduced to the following five: 


1. e = = . 2. = C2; ur 3. ei < C2; ur = u. 
4. ei < C2; ur < u. 5. ei < C23 un u, > C23 ui Cu. 


We introduce the following 

DEFINTrIoN 4: Let I = fur, eil, II = {ue, ez]; we call I equivalent to II 
(or II equivalent to I), and write I = II when both ui = u, ei = et and call 
I stronger than II (or II weaker than I), and write I > II (or II < I) when 
either ei = C2, ui < Mos Or ei < C2, ui = uz, or ei < ez, ui Cu. In the 
cases not accounted for (ei < ¢2, ui > u, and ei > ez, ui < un) we do not 
consider I and II as standing to each other in any of the relations* =, >, <. 

From Definition 4 follows immediately: 

Lemma 7: If I = II, I and II are each one implied by the other, and therefore 
if either one is true, the other is also true. 

If I> II, (II <1), II is implied by I, and therefore if I, a stronger con- 
gruence, is true, II, a weaker congruence, is a fortiori true. 

While any two congruences of the chain for a given m are not in any of the 
relations , >, <, to each other, we have 

TueoreM II: If we denote by II any residual congruence modulo m then 
either 

(a) there is a congruence of the chain, I, such that I = II; or 

(8) there is a congruence of the chain, I, such that I > II. 

In other words: Given any residual congruence modulo m, the chain contains 


a congrhence not weaker than it. 


* We develop the character of the relations between residual congruences only as far as they 
are needed for this paragraph. The use of the symbols , >, S in this connection is justified 
in view of the fact that (1) from J > IJ follows II <J; (2) from J II, II = III follows 
I = III; (3) from I > JI, II > III follows J III, and similarly for <; ete. The results 
of the remainder of this paragraph may perhaps also be expressed in the language of Kronecker 
modular systems. 
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Proof: Let II = , e] be the congruence to be examined, where e is a 
factor of m, say c-5 =m. If u (do), u (di), „ w(d,) are the various 
values, in descending order of magnitude, which A (d) assumes as d ranges 
over all divisors of m, then we know from Construction that our chain 
contains for each exponent (d) a congruence J = {u(d;), m/d;}. There- 
fore, for one of these factors, say for d;, u (d.) = u(6). By Construction, 
either d. = 6, or d. = k-6, k >1 an integer. Assume first 6 = d.: By 
Theorem I, § 2, J is the congruence of lowest possible degree with coefficient 
m/d;. Hence, if y = uw(d;), I = II by Definition 4; if y < u(d;), II is 
impossible, by Theorem I; if y > u(d;), I II, by Definition 4. Assume 
next d. d, k >1, with u (d.) =u(6), as above. By Theorem I, 
II’ = {u (6), e] is the congruence of lowest possible degree with coefficient c. 
By Definition 4, > JI’. If y =u(6), IJ = II“, and hence > if 
y <u(d), II is impossible; finally, if y >u(6), II’ > II, and, since 
I> II’ (above), II. Q.e.d. 

We have thus established, by combining the Construction with the last 
theorem: 

TuHeoreM II“: For every modulus m the Construction of the present section 
establishes a completely determined chain of residual congruences 


(x) (mod m), i. e., 6 ů0, (i 0, 1. 7), 


d; 


and every residual congruence is either equivalent to, or weaker than, a congruence 
of the chain. 
Compare Part II for the relation of the chain to Kronecker modular systems. 


§4. The signature S(m); continuation of the discussion of the chain 


At the present stage of our investigation, we are handicapped by the length 

of the process described under the Construction of § 3. Our next step consists 
in showing how the chain may be obtained directly, for a given modulus m, 
by simple arithmetical operations. This is of importance if the method is 
to be applicable to a given numerical] case. 
From the chain (m/d;) (mod em), or {u(d;), m/d;} 
(i = 0, 15 „ r), we derive a symbol which we call the “ signature of 
m, S(m), and which is obtained by placing together, for a given m, all the 
separate {u(d;), /d] (i = 0,1, ---,7) im decreasing order of magnitude 
of di, but writing for convenience 20% for each i. This leads to 

DEFINITION 5: For a given modulus m, we define the signature S(m) as 
follows: 

um) +++ 4 (d.) =0 
S(m) = m m m 
1 = m 
dy d,—1 1 


i 

1 

| 

1 

| 
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or (ignoring the trivial congruence), 


um) pw(di) +++ (1) 
S(m) = m 


where {u(d;), md] (i = 0,1, «++, 7) is the chain of congruences modulo m. 
It follows that the chain of congruences is completely determined by the 
signature and conversely, so that our problem reduces to the determination of 


the signature for a given m. We consider successively the cases: m = p, a 
prime; m = pi Po°** Des Pi p22 Des primes; m = 
prime; m = p’, y =p, pa prime: m = pi'+ py? +++ pr, pi, , p, distinct 


primes. 
1. m = p: the construction of §3 shows that the chain consists of the 
single congruence {p, 1}, besides the trivial {0, p}. Therefore 


» O 


Confusion with the binomial coefficient is not to be feared. 

2. m= pi pe, Pr < po < p,: from $1, it follows that the 
factors do, di, , d, of step 4 of the Construction, and their respective u 
values, are d; = py + (d.) = pei (0 = 0, 1, 7), so that 
the residual congruences are (m/d;) - 2“ ) (mod m), or 


(x), 

= 0 ) 
S (i = 

(pi Po (7 1) 


97. 5 —1 „ (P. p—1 


1 5—1 (P- 2) 


Example: m = 2.3.5. 11, u(m) = 
Chain of congruences: 11, Ry, 18, * (3, 11 - 5}, 12, 11.5. 3], 10, 11.5 


11 5 3 2 
11 u-5 1-5-3)" 
Wiitten out, the chain may be represented by either one of the systems: (all modulo 
2-3-5-11) 

1. 1. Hie (z -i) SO 

11.2 (z) 11 
11.5. SV (z) 


11.5. 3. 2 () 56.8. 


) 
0. 
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9 


3. m= p, <p. Now d. = p™' (i = 0, 15, „), and the chain of 
congruences is 
= (mod r), {p(y -i), p'} (6 =0,1,-++, x). 
Hence 
„ 
1 7 p 
1 P pr 


Example: m = 51, (51) = 20, 
20 15 5 


52 
Chain: {20,1}, {15,5}, {10, 5%}, (5, 5%}, 
1+ 2®=y(z2) 1 - (r — i) =0 (mod 5*) 
5 11 5. (2 i) SO (mod 54) 
52˙ . 410 () 5˙ . II. Zo (x i) =0 (mod 5*) 
5. 5 =y(z) 5. (2 1) =0 (mod 54). 

4. m= p’,p=y. Nov, as in 3, the factors of m are exactly the numbers 
76, 6 2 O, 15 „ y; but to different values of 8 may now correspond the 
same value of u (see $1). From §1, and from Construction, § 3, it is clear 
that the values of u for all divisors of m are exactly the multiples of p up to 
u (%), which is itself a multiple of . In our chain 


* m 
(m/d;) u (d.), 


the degrees“ 


u (d.) =u(p’) —t-p 
and the d; are easily determined by the following schedule (compare § 1) 


u (di) (d) u (di) u (do) 


Here, for 1 5 6; =f — 1 when 7 = 0 (mod pe) but 7 + 0 (mod ptt), 
that is, for i <7, 6; is the exponent of the highest power of p dividing 


* Where d. = 1, dr1 = u(p) =p. 


JJ. 
j=i j=r-l Jr 
j= j= j= 
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=i-p (so that 6, = &=--- 1, 6, = 2, ö = Spire 
= = = 1, 3 = 2, % = 1, „ = 3, = 1. In 
case 3, all 6 = 1; hence the greater complexity of case 4. For i = T, 6, is 
chosen so that LT ö, = y. In (4) therefore, for i < 7, Li, is the ex- 
ponent of the highest power of p dividing [A (d=) ]! so that, by Construc- 
tion, § 3, 

-,7-1), 
while 


m 


does not divide [u (di) II, but does divide [u(do)]! = [u(m)]!, without, 
however, being necessarily the highest power of p dividing [u(m)]!. (See § 1, 
Example, and the example below.) Our chain of congruences modulo p“, 
{u(d;), m/d;}, therefore is 

= p(x) mod 


pr * PY)—p = (2) mod 


7 = P(x) mod p’, 
py - a? = (2) mod v, 
or*, shorter, 


772 
, (i = 0, 1. 7), 


where 6; is determined from the schedule (4) above, and where Li ö I is 
defined to be zero fori = 0. Hence 


Case 3 is contained in 4 as a special case. 


Example: m = 3". 


11 — 


Therefore à (31) = 27 
ms 27 24 21 18 10 9 6 3 O 
S 31 32 33 35 36 37 3 300 gn)? 
leading to the chain of congruences 


= since 31 = 1. The trivial congruence, corresponding to i = 
is here, and frequently elsewhere, suppressed. 

+ Under 27 we write 1, not 3, because £4; must be 11. Of course 27! is the smallest fac- 
torial of which 3" is a factor, but also 3" and 3" are factors of 27!. 


s/ 7 8 
2 = p* = 
cee 2p p 0 
3 6 9 12 15 18 21 24 27 
iz a 1 2 1 1 17 
1 24 5 6 8 9 10 ill 
) 10 9 7 66 5 3 2 1 0 


1921] POLYNOMIALS AND RESIDUE SYSTEMS 


(2), 3. = (2), 30 (), modulo 3", 

127, 1], {24,3}, 121, 25], 18, 15, 35}, 12, 19, 37}, 16, 3}, (3, 
The influence of y = p is reflected in the fact that the exponent in the leading coefficients of 
the congruences does not range over all values 0, 1, 2, ---, 10, but only over 0, 1, 2, 3, 5, 
6,7,9, 10. 

We proceed to the general case, 

5. m= pH. pP. pl, pi, Po, „ p, distinct primes. We derive first 
a chain for m = m+ mz, i = , m2 = , and therefore (m, m2.) = 1 
In the Construction of § 3, we have to consider (d) for all d = pf! - 4, 
0 S pi S , 0 = po = ye, and then to collect in separate sets all d for which 
u (d)) has the same value, and to retain for each wu only the largest d (in case 
there is one largest; otherwise, one of the largest). From the d so selected we 
should construct the chain in the following manner: For a d = p%' - p§? 
assume u (p?!) > u:); then w(p? - pi?) = from $1, end. Next 
consider p§?t!; if w (pet) > w(pi?t!), we have pe - pet!) = w( pe), 
and the factor 54 - p? of m is then disregarded. Letting pe increase in this 
way from pz, o + 1, . to ½, we shall either still have u (t) > w( py), 
in which case all factors pf! - pet - pe are disregarded; or, 
u (pe!) = w(p%?), in which case there must be a largest exponent of po, (we 
designate it by po 1), for which still u > w( pet"), but 
u (pe) w(pe?**); then w( pe - = w(pet*). We consider in the 
ath. pet? ete.: assume that 


same way the factors - pf, pe 
u (r > w( pete), but u (pet) = w( pit’); then we disregard factors 
pertr ptt, «++, pet. and retain as next factor and 
so forth. 


In this manner we systematically discover which factors d of m must be 
retained, and it is clear that they can be arranged in increasing order according 
to the schedule . pet pet pet, «++, where p, = pr, p, = pz, and the 
sign of equality holds in at most one of the two relations. It is also clear 
that the factors to be retained are uniquely determined by this process. 


Example: m = 54+ 7%. (51. 73) 3. 7. In the following rectangular array for all 
divisors of m the factors which have to be retained for our chain are in heavy type: 
4 (71) =7, 4 (72) = 14, 41 (75) = 21, 

(51) =5, (51 . 71) 4 (51 72) = 14, (51 73) = 21, 
4 (52) = 10, (5 . 7!) = » (52 - 72) (5 73) 21, 
1 (55) = 15, 42 (53. 71) f p (5% . 72) 4 (53 . 73) 21, 
4 (5) 20, (54 71) =: (5472) = (51 750 21. 

We shall have the chain 

(r) mod 51 

3.5 () mod 54 

- 22-7 (r) mod 51 

- = y (r) mod - 

= (r) mod 5- 

41.5 = (r) mod 54 


oe we ST 


929 
| 
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or in condensed form: {21,1}, {20,7}, 15, 5. 7}, {14, 52. 7}, 10, 5 


7*}. The corresponding signature is 


1 8 
8657909 = 
1 7 5.7 


(5, 5% 


For a modulus m with a large number of factors this method would be very 
tedious. We show (first without proof) how we may obtain for the example 
above 8 (51. 7*) by inspection from 8 (5 and S(7*) and shall then prove 
that this method applies also to the (general) case m = m, + mz, (my, m2.) = 1. 


In 
2 5 5 2 7 0 
4 


88 | 5 7 


combine the two sets as follows, arranging the exponents in decreasing order of magnitude: 
21 20 15 14 10 7 
127 125 72 
In the second row every 7“ (except 7° and 75) is spaced between two powers of 5, say 5”, 56, 


y <6; thus 7' between 3 and 52, 7? between 5? and 5*. We multiply 7* by 5°, and replace 


7* by 7*- 58. Similarly, every 5“ (except 5*) is spaced between two powers of 7, 7‘, 7*, 
e St. This 5! is multiplied by 7*, so that 5! is replaced by 5'- 75. We obtain 


1=7° 5.7! 


and the first and third lines represent S (51. 73). 


There is one more point to consider. It may (and frequently will) happen 
that one of the congruences in the chain for Mi and one of the congruences in 
the chain for e will have the same degree. 


For example, in m = 2. 3 (2) = u (3?) =6. We have, for this case: 


ae 6 4 2 0) oe 9 63 0 
“(5 23 2˙ 20 2). 390 3 33 


and, combining (and writing the coefficients in two separate lines), 


hence§ 
7 34 9 
S = 
) ( 1 32 
leading to the chain 9, 1}, (8, 37}, (6, 22. 32 


We show that the method used in these two examples applies without change 
to the general case m = n- n’, n, n’ any factors of mn, which we assume to be 


7}, 
14 10 7 5 0 ) 4 
52 7 52 72 53 ° 7? 53 . 73 54 ° 73 N 
3.7 4.5 3-5 2-7 2-5 1-7 1-5 0 
12 7 125 7 5 7? 55 72,5% 
7! 5. 5% 7 5% 7 75, 
9 Ss 6 } 3 2 0 
3° 3? 33 3 
23 24 20 27 
3 . 2% 2. 3 3 . 2 2. 3 3% 2% 2 3% 3.27; 
3 2 0 
2 . 3 26.34 2 3% 
12 26. 34} 
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relatively prime, as we may do without causing loss of generality, since the 
case m = i has already been treated. 
For this, we prove 


Lemna S: Let 
u (Nn) pldy) 1 (d.) 


S(n) 
dy 


S(n’) 


dj 
so that the d;(d; > di4,) are certain factors of n, and the d, (dj > di41), cer- 


tain factors of n’; assume also that, after all exponents in both signatures have been 


arranged in decreasing order of magnitude, a part of the combination is as follows: 


a 


then the corresponding part of S(n un is: 


u (d.. ds) u dg) +++ dg) u (de- 


n 
} 1 ds 


(%. d,) (d, ds) 


n’ n’ n 


dy 4, 4, 4 

Proof: For any (d.) the signature 8 (n) yields a residual congruence 
u (d.), n/d;}n, and for any u (d,) we get from S(n’) a congruence {u(d;), 
n'/d;},. After combining the signatures in decreasing order of the u, as 
assumed in the lemma, we select any one of the u (d), for example (d.) 
This u (d.) will be followed“ by some (d“), say u (d,), and therefore 
u(d;) and, since d; and d, are relatively prime, = d,) 
(see § 1, end), where d d, is some divisor of n n“. Therefore (§ 2), there 
exists a congruence ſ½ (d, d,), und d, The possibility of a d) 
equaling a (d“) offers no difficulty. (See Example S (27. 34) above.) It 
is seen that all residual congruences modulo nn’ which the construction of § 3 
calls for, actually are obtained in this way. The lemma is thus proved. 

* Unless yu (d.) is smaller than any » (d“), which will always happen for the smallest »(d), 
except when the smallest A (d“) is smaller than any »(d). This does not give rise to any 


difficulty. See Example S (54 - 75), above. 


— 255 
n 
| — 4. 
u(n’) u (d.) 
n’ n’ n’ 
d,) ( desi - d.) 
n n n non 
43 d, d, d, 
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Since, by assumption, u (d.) >u(ds), u (dt) >w(ds), (d.) 
u (dz), we shall have (d.. dg) = u (d.), (di- dg) = u 
u (d., de) (d.,); similarly, (de d,) d,) =u (days), 
(dg, d,) =u(dg-); and u (d, = dy), u (d ds) =u (dr), 
... Then part of our signature 8 (n. n’) is given by 


(d.) (di) (d,) A (de) desi) (da-) (d,) 


nn’ nn’ nn’ nn’ nn’ nn’ un! nn’ 


— 


d. dg d, des da d, dai d, da- d, d, de 


But this represents exactly the law indicated in the special examples m = 54 . 73, m = 27 - 34 


above. To make this perfectly clear, we rewrite a few lines from these problems, with only 


obvious modifications: 


4 (54) 4 (53) p (52) 4 (51) 4 (1) 

— — 20 215 =10 =v 

n = 54, n’ = 73, S(5*) = ( 
5⁰ 5¹ 5² 53 54 


\ 


4 (73) 4 (72) 4 (71) 4 (1) 
— =14 227 =0 
S(P) =| 8 
7° 7¹ 73 


From these, by Lemma 8 and taking the last remarks into account, 


4 (73.54) 4 (54.72) 4 (53-72) 4 (72-52) 4 (52-71) 4 (71.51) 4 (51.70) 
2 ~ = 73) =p (54) = 53) =m (72) =p (52) = 71 = 51) 
S (54 73) = 40 — 40 4 0 40 
70 5 . 71 71. 5 7 672 


in agreement with the example. 
To show that, here again, the method requires no real modification when a u (d.) equals a 
u (d,), we indicate in the same manner the work for the other example, m = 2’ - 34. 


4 (27) 4 (214 4 (23) 4 (21) 4 (20) 
7 « 1797 =§ =ti =4 =2 = 
n=2), n’=3'. S(27) = ), 
20 23 24 96 97 


(34) 4 (32) (31) 4 (30) 
0 =9 =6 =3 =U ). 
32 32 32 3 


Then 
(3.27) 4 (2732) (2482) 4 (32.23) (23-31) (31.21) 4 (21.30 
=m (44) 27) (24) = (32) =p (32) =p (23) = (31) =p (21) 
S (2? - 3*) = 
30 . 20 20 32 32 23 32.24 33 33 . 26 26 34 


3 


33 25 


- 33 


in agreement with the example, since {6, 3? - 27.34 K {6, 32. 2827.4, and may therefore 
be omitted. 


The general case, m = pY'- p¥ --+ p,” , can always be treated by successive 
applications of the lemma last proved. However, it would be a very simple 
matter to display a schematic arrangement by means of which S (- h . 
is obtained in one step from 8 (t (i = 1, 2, „ ), if this were worth 
while. One example, below, is worked out in this way, and this will suffice to 


make clear the general process. 
72. (72) = 14, w(3*) =9, (27) = 8; w (2? . 31. 53) = 14. 


Example: m = 27+ 3 


a 8 6 4 2 0 3 6 7 0 
8 (2) ( ) S (34) = ( 9 ) (see above), 


29 23 2 4 26 27 


i. e., 
9 8 6 6 t a 2 
1 * 23-3 3-2 
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= 8 
1 7 32 
Chain: 14, 1 


We combine the results of S8 3, 4 in the following theorem. 
88 5 

THEOREM III: To any positive integer m (the modulus)—which we assume 
given in factored form—we can determine by a definite and very simple arith- 


metical process the signature 


u (n) u (dnn) wld) =0 
S(m) = m m m 


dy diay 


where each d; is a proper factor of all d;, J , and consequently a divisor of m, 
and do = m, d. = 1. 


S(m) completely determines the chain of residual congruences modulo im, 


m 


7. II (x — 7) =0 (mod m) 


(i 0, 1.5, „7 1) or 0, 1, 


= P(x) (mod n 


or, as we also write,* 


Every residual congruence modulo m is either equivalent to, or weaker than, a 
_ congruence of the chain. 
If m = ph - py +--+ py, then the number of congruences in the chain 
modulo m is = Y + ¥2 + --: + 7, (besides the trivial congruence). 


* When the modulus m is to be explicitly stated, 


m ) 


{ 
4(de), d; 


or 44 (di), =} (mod m). 
d; ) 


Trans. Am, Math. Soc. 17 
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es 14 7 0 
807209 = ‘ 
‘ ‘ 
14 9 8 7 6 U 3 2 0 
a 7¹ 
3² 
)3 94 2 7 
70 30 7} * 3; 7} 3? 23. 93 3? 725 2433 72 332 72; 2 311072 31 72 
S (27 - 3. 72) 
1 7 6 1 3 2 0 
7 22.327 22.32.72 21. 33.72 2 33.72 2.34. 72 Qi. 31. 7% 
, 19, 7}, 18, 7, 7. 7. , 2, 
— = — — eee mM 
do d —1 d, 
, 7) 
or, 
; 
m | 
Ju (d.), (s =O, I, „. 
d; | 
— ö 
2 
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$5. Completely reduced polynomials modulo m; the characteristic ( 


We make use of the chain of congruences of $$ 3, 4 to reduce any poly- 
nomial with integral coefficients to a normal form which is to be, modulo m, 
residually congruent to the given polynomial. First we may, by repeated 
application of the first congruence {u(m), 1}, reduce a polynomial of any 
degree to one of degree at most u(m) - 1; then repeated application of the 
second congruence, (di), m/d,}, permits us to reduce in the new poly- 
nomial the coefficients of , , to numbers of the set 
0,1, +--+, - 1. In the same way, {u(d2), /d] reduces, without 
affecting coefficients already reduced, the coefficients of 
to numbers of the set 0, I.. ., (% dz) — 1. Continuing in this manner, 
we find that finally the coefficients of “( are 
restricted to the numbers 0, 1, ---, (/e) — 1, while the coefficients of 
are not restricted at all beyond the obvious 
limitation that they shall belong to the set 0, 1, ---, m— 1. 

We are thus led to an arrangement of the following type: any polynomial 
is reducible by the chain of residual congruences modulo m to a normal form 
1* 


in which the degree is u(m) — and in which 


the terms with exponents: (1) =0, --- r-1)—1 u (di) +++ w(dr2) — 1 
have their coefficients re- 
stricted to the numbers Q -++ 1 


() — 1 
0 — 1 0 . (m/d;) — 1. 


We have then: counting the degree of the reduced polynomial as exactly 
u (m) 1, and assuming it written in the form ay + 


4% (m)—1 · 


the A lowest coefficientsf do ¢a,_,)-1 are each one of the m num- 


bers 0,1, +++, — 1; 

the u(di1) - u(d;) coefficients a,q@), , d, = are each one of the 
m/d; numbers 0,1, «+--+, (m/d;) — 1; 

the u (di) - coefficients a, (a), are each one of the 
numbers 0,1, --+, (dz) — 1; and, finally, 

the u(m) — u (di) highest coefficients 4, are each one of the 
m/d, numbers 0, 1. , — 1. 


* Admitting that one, or more, of the highest powers of « may have a coefficient O; or all 
coefficients may be zero, including the constant term. 

+ For conformity with what follows, we should write: the A (dri) — (d-) lowest coef- 
ficients,” but d. = 1, u(d;) = 0 ($1, beginning). 


28 
— 


1921] POLYNOMIALS AND RESIDUE SYSTEMS 259 


This is indicated by the following symbol which we call the characteristic 
of m, and denote by C(m). 

DEFINITION 6: We define the characteristic of m, C(m), by writing in a first 
line the number of coefficients in each subset and in a second line the number of 
values over which the coefficients in each subset may range, and arranging so that 
going from left to right corresponds to increasing degrees of the terms: 

| u(dr-2) — | +++ (di) - (d.) 


C(m) = m m 
m eee 
41 d. 


(di) (dz) w(m) - 4 (di) 
m m 
de d; 
We call a polynomial of degree = u(m) - 1, and of which the coefficients are 
restricted as indicated, a completely reduced polynomial mod- 
ulo m, or simply a completely reduced polynomial. 

Two completely reduced polynomials modulo m cannot be residually con- 
gruent modulo m without being identical; that is, if fi (r) SEC) mod m, 
and Fi (r), fe(a) are both completely reduced modulo m, then Fi () and 
f2(a) have corresponding coefficients equal to each other. For, if this were 
not so, we should have & (r) = Fi (r) - (r) =0 mod m; and in the poly- 
nomial & () at least one coefficient would be different from zero. Assume 


that in & () the highest term whose coefficient does not vanish is c 2’, 
u (d.) SY Since both in fi (2) and in fo (a) the coefficient of x” 
lies between 0 (incl.) and m/d; (excl.), we shall have 0 = c¢ < m/d;, thus 
contradicting Theorem 1, §2. In particular, a completely reduced poly- 


nomial is residually congruent to zero when and only when all coefficients and 


the constant term are zero. 

We have thus proved 

THEeorEM IV: Every polynomial with integral coefficients is modulo m re- 
sidually congruent to one and only one completely reduced polynomial. In 
greater detail: 

If the signature of m is 


= (4 u (di) u (d) 


1 di m/d, +++ m/d_y m/d,=m 
then the characteristic is 


C(m) = (dz) -u (di) (de) 


m m/d,—1 | my di 


and every polynomial with integral coefficients is modulo n residually congruent 
to exactly one polynomial do + aya + +++ + + % (% 
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＋ A, (d; (di-1)—1 + Gy (di 1) + + a, (d,)—1 · gt 
+ in which each of the coefficients 
Mo, has a value 0,1, I; each of the coefficients a, (a, , 
has a value 0,1, +++, (m/d;) — 1. 
For any m, the characteristic may be immediately read off from the signature. 
When m contains only distinct prime factors, or when m is of the form p’, 


y <p, the characteristic is expressible in terms of the prime factors and the 
exponent, without making explicit use of the A function: 


» 0 
(a) P3 = (7 700 


(b) m 2 po, Pi < po <p,; 
8 (pr Pp) 
= . 2 P1 
C (pi po Pp) 
PoPp-1° PI Po’ * * P2 | P3\" 
, 
— 1) ply—2) 
1 7 


P| 
But whatever the type of m, the characteristic, and thereby the set of com- 


pletely reduced polynomials modulo m, are in all cases easily determined. 
Examples are given in § 6. 


8 (D = 


We see that the chain of residual congruences modulo m, and likewise, the 
characteristic C (m) or the signature S (n), determine: 

1. The structure of the system of completely reduced polynomials modulo m; 

2. The structure of the individual completely reduced polynomial modulo m. 


§ 6. The number Un) of completely reduced polynomials modulo m. 
Classes of polynomials 
From the characteristic C (m) it is easily possible to determine the number 
Vn) of distinct completely reduced polynomials modulo m, since the char- 
acteristic gives the degree u (m) — 1 of the polynomial as well as the range of 
values for each coefficient. We obtain,“ since there are u(m) - u (di) coef- 


* Counting as a completely reduced polynomial also the polynomial all of whose coefficients 
are zero, as well as the constants 1,2, ---, m — 1. 


260 
0 
2 Po*Pp-1** 
Po-1 
0 
1 
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ficients which may independently assume any of the values 0, 1. , (d 
— 1 etc., 
THEOREM* V: 


4 (di)—« (dz) 4 (d —4 (di- 
N(m) = . m* (4r—1 
1 dy 


(5) = m*™ . d dy (d (di) 


m mn) d, 4 (di) d. 2 4 (d. 2) d —1 4 (d- 


Special cases are: 
(a) mp; N(p) = p?. 
(b) m *** Py, Pry , Py distinct primes: 
N (pi > Py) = py py 
(ei = N, ¥ < p; N(p*) = preter) = 


From the manner in which in § 4 the signature of mi- mz was derived from 
S (mn) and S (mz), it follows that, for mi, m2 relatively prime, we shall have 
N(m,+ m) = N(m,)-N(m:.). However, since we have explained a 
direct and simple method for determining N (m) for any given m, we shall 
not discuss the functional properties of VC). 

Our work suggests a division of all polynomials with integral coefficients 
into N(m) classes modulo m by grouping always into one class the 
infinitude of polynomials which are residually congruent modulo m to the same 
completely reduced polynomial. All polynomials belonging to the same class 
will be residually congruent to each other, modulo m, and we may select the 
completely reduced polynomial as representative of the entire class. The 
main property of the classes, for our present purposes, is expressed in the 
(now obvious) 


THEOREM VI: There are exactly N (m) classes of polynomials modulo m; every 


polynomial belongs to exactly one class; all polynomials belonging to the same 
class have the same complete residue system modulo m; no two polynomials 
belonging to distinct classes have the same complete residue system. 

From this follows a result, which, from its character, belongs to the third 
part of the present paper, and which may be briefly stated as follows: 

Of the m™ possible sets of m numbers which may be chosen from the elements 
0, 1, 2, % mn — 1 (with repetition), exactly N (m) are complete residue systems 
modulo m of some polynomial with integral coefficients. 


* Compare Part II, end. 
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7. 7 
Example“ 1: m=7. w(m) =7; 807) 


The completely reduced polynomials modulo 7 are the polynomials ao + a; 2 + +--+ + a¢2°, 
where each coefficient independently assumes all values 0, 15, „ 6. 
Example 2: m 2. 3.5 11. (mn) = 11; 


S(m) = 


Cen) 


N(m) = 11°- -5)?- 5. 31. (11.5. 3. 2)2 = 2. 3. 5 11", 


The completely reduced polynomials modulo 2.3. 5. 11 are the following: ao + dn 
+ + +++ + a2", where 0 do, ai <11-5-3-2; OSa,.<11-5-3; O=a;, 
a, < 11-5; 0 as, a, ar, ds, < 11. 

Example 3: m = 3". 4 (3ʃ¹) 


3 0 
3 zu 


N (3) = 
The completely reduced polynomials modulo 3" are ao + a, 2 + doz? + +++ 4+ a2", 
where 0 = ao, 1,2 <3"; 0 =43,4,5 < 3% 0 2. < 3°; 0 =a, 10, 11 < 37; 0 Saye, 13,14 < 3%; 
O = a5, 16.17 < 3°; 0 = ais, 19,20 C 33; 0 = aa, 22,23 < 32; O Saas, 25, 26 <3. 
Example 4: 2 . 3% 72. (27 


8 9 8 
Sn) = 
1 


C(m) = 


N(m)ft = 2®- 37 


The following property, which we illustrate only by considering the special 
case m = 2’ - 310. 72, also holds for all m, as is immediately seen. For com- 
plicated moduli, it may be used as a convenient check.—On the same linear 
scale measure off, starting always from the same point, 0, the lengths 2, 4, 
6,8 ue); 3,6,9 (30); 7,14 = (72). We obtain in this manner 
the following marks on our scale: 0 2 34678 9 14, that is, the first line of 
S (27. 3. 72), in reversed order; and the set of intervals between successive 
points, 2,1, 1,2, 1,1, 1, 5 gives the first line of the characteristic, in proper 
order, The analogy with the method of the ‘Sieve of Eratosthenes“ is 
obvious. 

* Compare examples of § 4. 

+ The fact that each exponent is divisible by the corresponding base, is the expression of a 
simple theorem which holds for all values of m. 

The completely reduced polynomials modulo 27 - 34+ 7? are d + + +++ +ajy32%, 


where 0 = ao, a; 2.31. 72. .; OS as < 7; 0 a, 10, 11. 12. 13 < 7. 


~ ~ 


= 2 5 3 2 0 )i 
1 11 11-5 11.5.3 11-5-3-2/)’ 
= ( 2 1 | 2 6 j 
— 11.5.3.2 11.5. 311.511) 
S(3") = * 24 21 18 15 12 9 6 | | 
3 FF FF 
C(3") = ( i398 (28 [3 ) 
Zu 310 3 37 35 37/3? 37’ 
7 6 1 3 2 0 
7 22. 32.7 23.32.72 21. 33. 72 26.33.72 26.34.72 27. 31. 72 
2 1 1 2 1 1 1 5N. 
31 . 72 2. 3. 72 Di. 33 72 21. 72 22 . 32 7222. 327 32.777 
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II. RESIDUAL CONGRUENCES AND KRONECKER MODULAR SYSTEMS 
§ 7. Residual congruences and Kronecker* modular systems 


From Part I, §§ 2, 3, 4, it follows that the existence of a residual congruence 
f(x) SG () (mod m) is equivalent to the existence of an ordinary identityf 
in æ of the following kind, in which co (r), ei (r), . denote polynomials 


in æ with integral coefficients (which may reduce to constants, including 0): 


f(x) G (2) + 0 (2) go (2) + ei (2) gi (x) 


(6) 


where 
m 
gx (2) = II (7-7) for k > 0; go(x) =m. 
k i=0 
This identity is, in Kronecker’s notation, equivalent to 


u(dg)—1 


f(x) = (x) moda | m; (x i); II (x -i); 


(7 2 (4 —10—1 
777 ° 
II 6-90. 


i=0 


and, if we write for shortness Im; - (2 -i) I = M, 


then M is a Kronecker modular system or a Kronecker modulus. 

The congruence has this peculiarity, that all functions g, (r) of the modulus 
are themselves residually congruent to zero modulo m. The fact that the 
functions of the modulus determine the complete chain of congruences modulo 
, as explained in Part I, corresponds to the combined statements :t 

(a) if f (2), G (r) are any two polynomials (with integral coefficients) such 
that mod m, then co (r), en (a), «++, () can be determined 
so that (6) is satisfied; 

(b) if any function 9 (2) is omitted from the modulus M, then there exist 
residual congruences f (2) =@(.2) (mod m) for which the identity (6) can- 
not be satisfied. 

In other words: every polynomial with integral coefficients, and whose 
value is divisible by m for all integral values of x, is representable in the 

* Instead of using, as in the text, one Kronecker modulus, I had originally employed a set 


of congruences with double moduli (, ¢(z)). L. E. Dickson, in conversation, suggested the 
use of a single Kronecker modulus. 
F The sign of equality, =, is used, to avoid confusion with the sign for“ congruent to.“ 
t On account of the properties (a), (b), the modular system M may be called a“ re- 
duced fundamental“ modular system. For a related “ fundamental system,“ introduced 
for a different purpose, and consequently not reduced, see Hensel, Ueber die Zahlenteiler ganz- 
zahliger Funktionen, Journal fiir Mathematik, vol. 116 (1896), pp. 350-356. 


— — — 263 
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form Diz) ce: ge (a), and if any of the g, (2) are omitted, there will be such 
polynomials ¢ () which can no longer be represented in the form (6). 

Many authors have continued along various lines the work on modular 
systems inaugurated by Kronecker. One problem, in particular, is the 
examination of the “ equivalence ”’ of two such modular systems, and the 
reduction of a modular system to a canonical form. This subject is treated, 
for example, in several papers by Hensel* and Landsberg.* The necessary 
and sufficient conditions that a modular system shall be in canonical form 
may, according to these authors, be stated as follows (see 5 of last footnote, 
p. 365): 

A modular system, M, is in canonical form when it is of the form 


M ge, 9a» „ Ors |, 

where 

J. gi (i = 1, «++, 7) are polynomials (in x) with integral coefficients and with 
the coefficient of the highest power of x each equal to unity; 

II. The degrees y; of gi (i = 1, ---, 7) are decreasing integers, yi > Vn: 

III. V. (i = 1, +++, 7) are integers and each k; is a proper factor of the next 
Fi (and therefore of all succeeding k); 

IV. Foro = I, 2, 7 Il, rl, the polynomial g, is divisible by the 


(and this new system is then, as a consequence of I-IV, again in canonical form). 
After these preliminary remarks, we may state the relation between Part I 
of the present work and the theory of Kronecker modular systems as follows: 


modular system 


THeoreM VII: For a given m the Kronecker modular system 


u(m)—1 42 1204201 
M = E II (2-3); 2. II (2-4); 


7 


m 420410) —1 


*1, 2. Hensel: Zuriickfiihrung der Divisorensysteme auf eine reducierte Form, I, Journal 
fiir Mathematik, vol. 118 (1897), pp. 234-251; II, vol. 119 (1898), pp. 114-130; 

3. Hensel: Ueber die elementaren arithmetischen Eigenschaften der reinen Modulsysteme, 
ibid., ol. 119 (1898), pp. 175-185; 

4. Landsberg, Ueber Modulsysteme zweiter Stufe und Zahlenringe, Nachrichten der 
Gesellschaft der Wissenschaften zu Göttingen, Mathematisch- 
Physikalische Klasse, (1897), pp. 277-303 (277-286). 

5. Encyclopédie des sciences mathématiques, Tome I, vol. 2, Landsberg-Hadamard- 
Nürschäk, Propriétés générales des corps et des variétés algébriques, pp. 342-366. 

See also Hancock, Canonical forms for the unique representation of Kronecker’s modular 
systems, Journal fir Mathematik, vol. 119 (1898), pp. 148-170. 


E 


1921] POLYNOMIALS AND RESIDUE SYSTEMS 
which is uniquely determined by the signature 


(m) = m m m 


d—1 


is in canonical form. 

Proof: Conditions I, II, III are obviously satisfied. To show that IV is 
also satisfied, it is only necessary to translate condition IV into language not 
involving the special notation of modular systems: 

IV. For ¢ = 1,2,-+--,r—1,r>1, the polynomial g, can be expressed 
(identically in æ) in the form 


I. 
= Jott k, — 2° Jo+2 +: ke V. Iv ’ 


where Wii, „ W-41, and (as we know), g,41, „ g, are polynomials in x 
with integral coefficients, and (as we know), e, are 
integers. 

Applying this condition to our modular system M, we obtain for IV: 

We have to show that it is possible to determine, for ¢ equal to any of the 
values 1, 2, ---, r 1, a set of polynomials in z, Ve, Vi, „ Wr, with 


integral coefficients, such that the following equation is satisfied identically in æ: 


d 


10 —1 


42 (41)—1 


i=0 i=0 


0, 


Example: m = 2 . 


(2 —i); (2 - i); 3.5 (2 — i); 2 
Then 


5 II. Se -i) 5. (2 i) += 
for Vi = (x —3) (4 —4), = O0, = 0; similarly 


—ͤ 
0 
8 
4 (d 
4 4r— —2 
d,—g—2 

Inspection shows that this happens for 
24 
v. - II (z -i); 

(2-3-5) (5 3 

289 5 3-5)? 
—ͤ 
q 
3 5 
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for J. = x 2, / = O; and 


2 
1 ‘ 2 
II. (2 i) 


for = x(x 1). 

The problem of determining N (m) for a given modulus (see § 6) has its 
exact counterpart in the case of a general modular system in canonical form, 
and is treated by Hensel, loc. cit. 3, and Landsberg, loc. cit. 4. In Lands- 
berg’s terminology, N (m ) is the Norm of the modular system. 


(To be continued.) 
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